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Abstract. We consider homogeneous factor models on uniformly sparse graph sequences 
converging locally to a (unimodular) random tree T, and study the existence of the free 
energy density 4>, the limit of the log-partition function divided by the number of vertices n 
as n tends to infinity. We provide a new interpolation scheme and use it to prove existence 
of, and to explicitly compute, the quantity <j> subject to uniqueness of a relevant Gibbs 
measure for the factor model on T. By way of example we compute <fi for the independent 
set (or hard-core) model at low fugacity, for the ferromagnetic Ising model at all parameter 
values, and for the ferromagnetic Potts model with both weak enough and strong enough 
interactions. Even beyond uniqueness regimes our interpolation provides useful explicit 
bounds on 4>. 

In the regimes in which we establish existence of the limit, we show that it coincides with 
the Bethe free energy functional evaluated at a suitable fixed point of the belief propagation 
(Bcthe) recursions on T. In the special case that T has a Galton- Watson law, this formula 
coincides with the non-rigorous "Bethe prediction" obtained by statistical physicists using 
the "replica" or "cavity" methods. Thus our work is a rigorous generalization of these 
heuristic calculations to the broader class of sparse graph sequences converging locally to 
trees. We also provide a variational characterization for the Bethe prediction in this general 
setting, which is of independent interest. 



Let G = (V, E) be a finite undirected graph, and a finite alphabet of spins. A factor 
model on G is a probability measure on the space of (spin) configurations a G !% v of form 



where ip = ip 13 is a symmetric function J?T 2 — > R>o parametrized by (3 G R, tp = ip B is 
a positive function E£ — > M>o parametrized by B G M, and Zg,±{@, B) is the normalizing 
constant, called the partition function (with its logarithm called the free energy). The pair 
t/j = (/0,'0) is called a specification for the factor model (1-1). 

In this paper we study the asymptotics of the free energy for sequences of (random) 
graphs G n = (V n = [n],E n ) in the thermodynamic limit n — > oo. More precisely, with 

Date: July 24, 2012. 

2010 Mathematics Subject Classification. 05C80, 60K35, 82B20, 82B23. 

Key words and phrases. Factor models, random graphs, belief propagation, Bethe measures, Potts model, 
independent set, Gibbs measures, free energy density, local weak convergence. 

Research partially supported by NSF grants *t*DMS-1106627 and tCCF-0743978, and *Dept. of Defense 
NDSEG Fellowship. 



1. Introduction 




1 




i 



2 



A. DEMBO, A. MONTANARI, AND N. SUN 



Z n (/3, B) = Z Gni ^(f3, B) and E n denoting expectation with respect to the law of G n , we seek 
to establish the existence of the free energy density 

(j)((3,B) = lim <j> n tf,B), where <j> n tf,B) = -E n [log Z n (/3, B)], (1.2) 

n— >oo jt, 

and to determine its value. (In the literature, 0(/3, S) is also referred to as the "free entropy 
density" or "pressure.") 

The primary example we consider is the q-Potts model on G with inverse temperature 
and magnetic field B, the probability measure on S£ v (with = [q\ = {1, . . . , g}) given by 

^ B (o) = z J^ m w{p E - + (L3) 



G 



(ij)eE iev 

The model is said to be ferromagnetic if (5 > 0, anti-ferromagnetic otherwise. (In analogy 
with the Potts model, in the general factor model setting we continue to refer to /3 as the 
interaction or temperature parameter and to B as the magnetic field.) 

Potts models have been intensively studied in statistical mechanics because of their key 
role in the theory of phase transitions [43], critical phenomena [46], and conformally invari- 
ant scaling limits [35]. As demonstrated for instance in [32] for the Ising model (i.e. the 
Potts model with q = 2), determining the limit (1.2) plays a key role in characterizing the 

8 B 

asymptotic structure of the measures Uq in the thermodynamic limit. Potts models are 
also of great interest in combinatorics: recall in fact that the partition function admits a 
random-cluster representation ([14, 21]; see also §4.2), which at B — reads 

«o) = J](^-i)iV F) , 

FCE 

with k(F) denoting the number of connected components induced by the subset of edges 
F C E (cf. (4.2)). Up to a multiplicative constant this coincides with the Tutte polynomial 
Tg(%, y) of G evaluated at x — 1 + q{e^ — 1) _1 , y = e 13 (see e.g. [40]). 

Mathematical statistical mechanics has focused so far on specific graph sequences G n , 
e.g. on finite exhaustions of the rectangular grid or other regular lattices in d dimensions 
with d fixed. Under mild conditions on the sequence, existence of the free energy density 
is a consequence of the following well-known argument (see e.g. [36, Propn. 2.3.2]): each 
graph G n can be decomposed into smaller blocks by deleting a collection of edges whose 
number is negligible in comparison with the volume. Consequently the sequence log Zq k is 
approximately sub-additive in n, implying existence of the limit (see [24]). 

In this paper we consider sparse graphs with a locally tree-like structure — formally, graph 
sequences G n converging locally weakly to (random) trees; see Defn. 1.1 below. Although 
the study of statistical mechanics "beyond Z d " is not directly motivated by physics consid- 
erations, physicists have been interested in models on alternative graph structures for a long 
time (an early example being [12]). Moreover, the study of factor models on sparse graphs 
has many motivations coming from computer science and statistical inference (see [8, 31]). 
Indeed, another example we will consider is the independent set or hard-core model on G 
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with fugacity A > 0, the probability measure on S£ v (with = {0, 1}) given by 

(where B = log A is the magnetic field). This model always has anti- ferromagnetic interac- 
tions, and is of significant interest in computer science. The independent set decision problem 
is NP-complete (via the clique decision problem [7, 26]). As A increases the measure Uq be- 
comes increasingly concentrated on the maximal independent sets; the optimization problem 
of finding such sets is NP-hard [28] and hard to approximate ([47] and references therein). 
The problem of counting independent sets (i.e. computing Zq{1)) for graphs of maximum 
degree A is ^P-complete for A > 3 ([20] and references therein). Although there exists a 
PTAS (polynomial-time approximation scheme) for Zq{X) for A below a certain "uniqueness 
threshold" [42], a series of previous works (see [33, 38, 17] and references therein) gave strong 
evidence that computation is hard for any A above this threshold. This question was resolved 
simultaneously in the subsequent works [18, 39], with [39] building on methods from this 
paper. 

Since infinite trees are non-amenable, G n cannot be decomposed by removing a vanishing 
fraction of edges, so the preceding argument no longer applies: in physics terms, surface 
effects are non-negligible even in the thermodynamic limit. Despite this, statistical physicists 
expect the free energy density (1.2) to exist on a large class of locally tree-like graphs. Even 
more surprisingly, employing non-rigorous but mathematically sophisticated heuristics such 
as the "replica" or "cavity" methods they derive exact formulae for this limit for a number 
of statistical mechanics models on locally tree-like graphs (see e.g. [31] and the references 
therein). The primary example considered in these works is the graph chosen uniformly at 
random from those with n vertices and m = m(n) edges, with m/n — > 7 G K; such graphs 
converge locally to the Galton- Watson tree with Pois(27) offspring distribution. The Galton- 
Watson tree with general offspring distribution can be obtained as the local weak limit of 
random graphs with specified degree profile corresponding to the offspring distribution; the 
physics heuristics extend to this and even more general settings. 

There is no good argument for why the limit (1.2) exists; the heuristic replica or cavity 
methods compute this limit starting from the postulate that it exists. A significant break- 
through was achieved by the interpolation method first developed by Guerra and Toninelli 
[23] for the Sherrington-Kirkpatrick model from spin-glass theory, and then generalized to 
a number of statistical physics models on sparse graphs [15, 16, 34] and related constraint 
satisfaction problems [4] . This method establishes super-additivity of log Zc n which implies 
existence of the limit (1.2). Unfortunately, this approach appears limited to models with 
repulsive interactions, i.e. in which higher weight is given to configurations in which neigh- 
boring vertices take different values. In particular, it does not apply to the ferromagnetic 
Potts model. This is especially puzzling because the heuristic physics predictions do not dis- 
tinguish between the two cases, and there is no fundamental reason why the limit should be 
computable in one case and not in the other. Further, this interpolation method only applies 
to very restricted classes of graph sequences (typically, uniformly random given the degree 
sequence); notably, existence of the limit is not proved for deterministic graph sequences. 
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Finally, the method gives no way to actually compute the limit, although interpolation has 
been used to prove upper bounds [15, 16, 34]. 

In this paper we follow a different approach relying only on local weak convergence of 
the graph sequence (G n ) n >i to some limiting (random) tree. The general idea is that the 
corresponding factor models (1.1) must converge (passing to a subsequence as needed), to 
a Gibbs measure on the limiting tree; the task then "reduces" to the one of identifying the 
correct limit. This is still a substantial challenge because, in general, there is an uncountable 
number of "candidate" Gibbs measures for the limit. Nevertheless, this program was carried 
through in [9] for Ising models on graphs converging locally to Galton- Watson tree, under a 
"uniform sparsity" assumption (Defn. 1.3), on the degree distribution (it is further assumed in 
[9] that the distribution has finite second moment; this condition was relaxed in [11], thereby 
handling the case of power law graphs). The result of [9, 11] provides also a fairly explicit 
expression Q((3,B) for the free energy density, defined solely in terms of the limiting tree. 
This expression coincides with the so-called "Bethe prediction" of statistical physics, derived 
earlier for random graphs with given degree distribution using the "replica" or "cavity" 
methods. 

We develop this approach here in more generality. Rather than considering a specific model 
such as the Ising, we establish results for general abstract factor models satisfying mild reg- 
ularity conditions (see (HI) below), covering in particular the Potts and independent set 
models. We also make no distributional assumptions on the graphs G n or the limiting ran- 
dom tree, other than some integrability conditions (see Defn. 1.3 and (H2) below). In this 
setting we develop a general interpolation scheme (Thm. 1.13) which, under appropriate 
assumptions, bounds differences 4> n ((3,B) in the limit n — > oo by differences in a functional 
$(/3,.B) (see (1.10)) defined solely in terms of the limiting tree. Although we continue to 
refer to this $(/3, B) as the "Bethe prediction," we remark that it is a considerable general- 
ization of earlier formulae obtained in the special case of Galton- Watson trees by statistical 
physics methods. It is defined as the evaluation of the "Bethe free energy functional" (1.9) 
at a specific Gibbs measure on the limiting tree, and corresponds to what physicists call 
the "replica symmetric solution": whereas it is expected to hold in the high-temperature 
regime (i.e. with small enough interactions), for many factor models it is incorrect at low 
temperature. However, we will show that in "uniqueness regimes," where the set of Gibbs 
measures on the limiting tree corresponding to the factor model specification ^ is a singleton, 
the upper and lower bounds of Thm. 1.13 match to completely verify the Bethe prediction 
(Thm. 1.14). 

We then apply our interpolation scheme to compute the free energy density in specific 
models. We verify the Bethe prediction for the independent set model with low fugacity 
(Thm. 1.11) as a consequence of Thm. 1.14. Further, by using monotonicity properties to 
restrict the set of relevant Gibbs measures, we obtain results for the Potts model going 
beyond the implications of Thm. 1.14: for q = 2 (Ising), we verify the Bethe prediction for 
all /? > 0, B 6 M (Thm. 1.8), extending the results of [9, 11] to general locally tree-like 
graph sequences. For general q, we verify the prediction in regimes of non-negative (j3, B) in 
which two specific Gibbs measures on the limiting tree coincide, namely, the Gibbs measures 
arising from free and 1 boundary conditions coincide (see Defn. 1.7 below). This condition 
is satisfied throughout the range {/3 > 0, B > 0} for q = 2; when q > 3 there are regimes of 
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non-uniqueness in which it fails, but we will show that it is satisfied both at (3 sufficiently 
small and sufficiently large, i.e. at high and low temperatures. 

Thm. 1 .13 can give useful bounds even beyond uniqueness regimes. As an illustration, 
we study the Potts model in the case that G n converges locally to the <i-regular tree T^. 
In Thm. 1.10 we explicitly characterize the non-uniqueness regime of this model and use 
Thm. 1.13 to give bounds for n (/3, B) within this regime. In a forthcoming work with Allan 
Sly we prove that in this setting, B) exists and matches the lower bound of Thm. 1.10. 
We also compute there the asymptotic free energy 0(A) (all A > 0) for the independent set 
model on <i-regular bipartite graphs. In contrast, for generic non-bipartite G n the consensus 
in physics is for a full replica symmetry breaking for large enough A, and consequently there 
does not exist even a heuristic prediction for the free energy density in this regime. 

As mentioned above, the Bethe prediction Q(f3,B) is the evaluation of the Bethe free 
energy functional at a specific Gibbs measure on the limiting tree. This Gibbs measure has 
a characterization in terms of "messages" h x ^ y = h(r,x-^y) defined on the directed edges 
x — > y of each tree T, such that the entire collection of messages is a fixed point of a certain 
"belief propagation" or "Bethe recursion" (1.14). Motivated by the finite-graph optimization 
of [44], we provide a variational characterization of the Bethe prediction (Thm. 1.16) which 
is of independent interest. In particular, this formulation suggests non-trivial connections 
with large deviation principles. 

1.1. Local weak convergence and the Bethe prediction. We study factor models on 
graphs which are "locally tree-like" in a sense which we now formalize, starting with a few 
notations and conventions. All graphs are taken to be undirected and locally finite. In a 
graph G = (V, E), let d denote graph distance, and for v G V write B t {y ) for the sub-graph 
of G induced by {w G V : d(v,w) < t}. Write v ~ w if v,w are neighbors in G, and write 
dv for the set of neighbors of v and D v = \dv\. Let Q. denote the space of isomorphism 
classes of (finite or infinite) rooted, connected graphs (G, o). A metric on this space is given 
by defining the distance between (G±, oi) and (C7 2 , o 2 ) in Q. to be 1/(1 + R) where R is the 
maximal r G Z> U {oo} such that Br(oi) = Br(c>2)] with this definition Q. is a complete 
separable metric space (see e.g. [1]). Let T. C Q. denote the closed subspace of (rooted) trees 
T = (T, o), the acyclic elements of Q.. We write T* for B t (o) in T, and in particular we 
use T° to denote the single-vertex tree. We now define the precise notion of graph limits 
considered throughout this paper. 

Definition 1.1. Let G n = (V n ,E n ) (n > 1) be a sequence of random graphs, and let /„ be 
a vertex chosen uniformly at random from V n . We say G n converges locally (weakly) to the 
random tree T if for each t > 0, B t (I n ) converges in law to T* in the space We say in 
this case that the G n are locally tree-like. 

We will make repeated use of the fact that local weak limits of graph sequences have 
the property of unimodularity, whose definition we recall here (for a detailed account see 
[1]). Let Q.. denote the space of isomorphism classes of bi-rooted, connected graphs with a 
distinguished ordered pair, denoted (G,i,j) (we do not require i ~ j); Q.. is metrizable in a 
similar manner as Q.. 
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Definition 1.2. A Borel probability measure \i on Q. is said to be unimodular if it obeys 
the mass-transport principle: 

E m[ =E^[ Y f( G > x >°)] V/ : 0.. K> Borel. (1.5) 

xev(G) ' xev(G) 

We say that \x is involution invariant if (1.5) holds when restricted to / supported only on 
those (G, x, y) with x ~ 

A measure /i on (/ is involution invariant if and only if it is unimodular [1, Propn. 2.2]. 
The concept of unimodularity first appeared in [5], where it was observed that local weak 
limits of graph sequences must be unimodular [5, §3.2]. The converse of this implication 
remains a well-known open question (see [1]). 

Definition 1.3. The graph sequence G n is uniformly sparse if the Dj n are uniformly inte- 
grable, that is, if 



lim lim sup E n [D In l{D In > L}) 
(where E n denotes expectation over the law of G n and /„). 

We assume throughout that G n {n > 1) is a uniformly sparse graph sequence converging 
locally weakly to the random tree T of (unimodular) law /i such that fi(D = 0) < 1; 
this entire setting is hereafter denoted G n —>i wc /i- In this setting we will describe general 
conditions under which the asymptotic free energy (j)((3,B) for the factor model (1.1) exists 
and agrees with the "Bethe energy prediction," which we now describe. (If the sequence of 
random graphs G n is such that G n -^i wc fi for almost every realization of the sequence, then 
our results apply instead to the a.s. limit of n' 1 log Z n (f3, B).) 

Let A%- denote the (|«f2T| — l)-dimensional probability simplex. Let T. + denote T. without 
the single- vertex tree T°, and let T c C Q.. be the space of isomorphism classes of trees 
T G T, + rooted at a directed edge x — > y, written (T, x — > y) or simply x — )■ y for short. If T 
has law \i for fi a unimodular measure on T., we let ^ and fj} denote the laws of (T, J — > o) 
and (T, o — > J), respectively, for J chosen uniformly at random from do conditioned on the 
event {T £ T, + }. Unimodularity of fi is then equivalent to 

E^[D x f(T,x^ y)\ = E^[D y f(T,x^ y)], 

so in particular ^ and ^ are equivalent measures. 

Definition 1.4. The message space is the space H, = of measurable functions 

h : T e x R 2 -+ A*, {(T,x^ y),P,B) h- {h^ y (a)U^, 
taken up to ^-equivalence. 

Remark 1.5. For (T,x — > y) £ T e let T x ^ y denote the component sub-tree rooted at x 
which results from deleting edge (x, y) from T. The interpretation of h x ^ y is that it is a 
message from x to y on the tree T, giving the distribution of o~ x for the factor model (1.1) 
on T x ^ y . Indeed, although we do not require it in general, in our concrete examples h x ^ y 
depends only on this component sub-tree. 
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For T G T. and heH, let 

$ T (/3, B, h) = $-(/?, 5, h) - $f,(/3, B, h) = 5, /i) - i ^ 4 oj) (/3, 5, /i) (1-6) 



2 

jedo 



where 




fl, h) = log J V^(<r) TT r^(^ 3 )^( ffj ) L (1.7) 



( \ }[ ° j \(3, B, h) ee log < ^Y(rx.rx ; )// ; ..(a,)//,, k J>. f L> 



We take the usual convention that the empty sum is zero and the empty product is one, so 
$t = l°g(X](T V ; ( <J )) in case T = T°- Although we suppress it from the notation, in the above 
equations and h are taken to be evaluated at (/3, B). The Bethe free energy functional on 
H for the factor model (1.1) on G n -^i wc \i is defined by 

^ fl (^B,h) = E fl [^ T (P,B,h)}, (1.9) 

provided the integral is well-defined (see Lem. 2.2). The Bethe prediction is that the asymp- 
totic free energy cj)((3,B) of (1.2) exists and equals 

$J ethe (A B) = B, h*) (1.10) 

where h* G H is a certain fixed point of the belief propagation (Bethe) recursion (1.15) 
below. We often drop the subscript \x when it is clear from context. 

Remark 1.6. In the case that the recursion (1.15) has multiple fixed points, the Bethe 
prediction is defined to be the supremum of $(/3, B, h) over admissible fixed points h. While 
in the abstract factor model setting all fixed points are admissible, in specific models typically 
there are "natural" criteria restricting the set of admissible fixed points. We will demonstrate 
this in the Ising and Potts models where restrictions are imposed by monotonicity and 
symmetry considerations. 

The rationale for the Bethe prediction is explained in detail in [8, §3]. In brief, when G 
is a finite tree and \Lq is the law of (G, I) for / a uniform element of V (fie is a measure 
on T.i but not necessarily unimodular), the equations (1.15) have a unique solution, given 
by the so-called "standard message set" (see [8, Rmk. 3.5]). In this setting it holds exactly 
that \V\- l \ogZ G (l3,B) = $^,(0,5) [8, Propn. 3.7]. The heuristic then is that for G n 
locally tree-like, the (normalized) free energy <p n is approximated by a similar expression for 
n large. We emphasize that no averaging over the vertices of the tree T takes place in the 
definition of $t: instead, the averaging of 3>(g,v) over t ne vertices v G G in the evaluation 
of $ MG (/3, B) corresponds to the averaging with respect to the law fj, in the evaluation of the 
Bethe prediction $^(/3,S). Indeed, for non-amenable T G T. the sub-trees T* typically do 
not converge locally weakly to T (see e.g. [8, Lem. 2.8]). 

The following is a terminology which we adopt throughout the paper: 
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Definition 1.7. If G is any graph and U a sub-graph, the external boundary dU of U is 
the set of vertices of G\U adjacent to U . Let U + denote the sub-graph of G induced by the 
vertices in Vu U dU. For U finite (so U + is finite, since G is locally finite), and A a measure 
on J%~ 9U , the factor model on U with A boundary conditions is the probability measure on 
configurations a_u G 3£ Vv given by 

(ii)eB C7+ i6L/ 

(Throughout, = indicates equivalence up to a positive normalizing constant.) The case in 
which A gives probability one to the identically-cr spin configuration on dU (cr G is 
referred to as o"o boundary conditions and denoted z/* = z/ 70 , while the case in which z/* is 
uniform measure on ^ du is referred to as free boundary conditions and denoted A = A. 

1.2. Application to Ising, Potts, and independent set. Before formally stating our 
main theorem for general factor models, we mention its consequences in some models of 
interest: we verify the Bethe prediction for the ferromagnetic Ising model at all temperatures, 
the ferromagnetic Potts model with field B > in uniqueness regimes, and the independent 
set model with low fugacity A. 

1.2.1. Ising model. The Ising model is the Potts model (1.3) with q = 2. For convenience we 
use the equivalent formulation which takes = {±1} and defines the probability measure 
on % v 

4 ,B 0i) = YWrn exp I ^ E + 5 X> [■ ( L12 ) 

For T G T. let Jij^ = h t rf h,l3,B denote the root marginal for the Ising model of parameters 
(/3, B) on T* with + boundary conditions (i.e., with a v conditioned to be +1 for all v at level 
t + 1), and similarly define h T corresponding to free boundary conditions. For f G {f, +} let 
hrp = JiTp B = li m t-s>oo h>T B • (Existence of the limits h { T , for the Ising model is an easy 
consequence of Griffiths's inequality; see Lem. 4.1.) We then define messages h$ G by 

A — A 

for T x _> y as defined in Rmk. 1.5. For G n — >i wc /x, the Bethe free energy prediction for the 
Ising model with /3 > 0, B > is 4>(/3, B) = $^(/3, B, h + ). This prediction was verified in [9, 
Thm. 2.4] for uniformly sparse graph sequences converging locally weakly to Galton- Watson 
trees subject to the second-moment condition E^[.D^] < oo, which was relaxed in [11] to a 
(1 + e)-moment condition. We have the following generalization of this result to an arbitrary 
limiting law. 

Theorem 1.8. For the Ising model (1.12) on G n -^-i wc \i, 

( p(f3,B) = ^^f3,B,h i ) = ^>^f3,B,h + ) 
for (3 > 0, B > 0. Also 0(/3, B) = 0(/3, -B) and 0(/3, 0) = lim^o <K/3, B) . 
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1.2.2. Potts model. Throughout the remainder let (Pq,B ) < (Pi,Bi), where < means 
coordinate-wise less than or equal to. An interpolation path is a piecewise linear path, with 
each piece parallel to a coordinate axis, increasing from ((3q,Bq) to (/3i,Bi) with respect to 
the partial order <. 

We restrict our attention to the Potts model with ft, B > 0. In this regime we are able 
to use a random-cluster representation to extract important monotonicity properties. For 
T G T. and $ G {f, 1} let = B denote the root marginal for the Potts model 
on T* with f boundary conditions. Let h\ = h^, 13,3 = lim^oo li^^ ,B (existence of the 
limits h T , h\< for the Potts model follows from monotonicity properties of the random-cluster 
representation; see Cor. 4.4). We then define messages h} G by h\^ y = h\,^, and let 

7Z^ = {(p,B) : /i f = h\ //-a.s.}. 

We also define 

^ = ({0} x M> ) U (M> x {oo}) U ({oo} x E >0 ). 

Theorem 1.9. For the Potts model (1.3) with q > 2 and (3,B > on G n — >i wc fi, the 
following hold (with $ = $ M; 7Z = IZ^): 

(a) If there exists an interpolation path contained in 7Z joining ((3,B) and TZ^, then 

( p(f3,B) = ^(f3,B,h f ) = ^,B,h 1 ). 

(b) If there exists an interpolation path from (/3 , -Bo) to (Pi, Bi) along which h { is continuous 
(in the interpolation parameter) , then 

liminf [(j) n (Pi, Bx) - </> n (p ,B )] > ^ 1 ,B 1 ,h i ) - $(/3 , B , h'). 

n— »oo 

If h { is replaced with h l then we have instead 

limsup [0 n (/?i, Bi) - <j} n (Po, B )] <$(/3i, B 1 , h 1 ) - $(/3 , B , h 1 ). 

n— >oo 

We obtain more explicit results when the limiting tree is the <i-regular tree T^. 

Theorem 1.10. For the Potts model (1.3) with q > 2 and f3,B > on G n -^i wc the 
following hold (with $ = $ Td , 1Z = 7Z Td , and = {(3,B > 0}\1Z): 

(a) If d = 2, 7Z^ = 0. If d > 2 and q = 2, there exists < /?_ < oo such that TZ^ = {B = 
0,P > /?-}. If d > 2 and q > 2, there are smooth curves /3f(B) and P+(B) defined on 
[0,B + ] with Pt(B+) = P+(B + ) such that 

TZ^ = {B = 0,(3> ft(0)} U {0 < B < B + , (3 G [P { (B), P+(B))}. 

(b) For (P,B) <£ 1Z + , <j>(P,B) = $(/3,5,/i f ) = $(/3, B, h l ). If (f3,B) G dTZ^ with p = 
p f (B), then <f>{p,B) = $(/3, B, h f ). If (p,B) G with > p + (B), then (j)(p,B) = 
<&(P,B,h>). For (p,B) in the interior 1Z° + oflZ + , 

max{^(p,B,h 1 ),^(p,B,h i )} < liminf <f> n (p,B) 

< lim sup B) < mm{^ i (P,B),^ 1 (P,B)} 

n—>oo 
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where 

B) = 5, /i f ) + [$(/?, B, /i 1 ) - B, fc 1 )], 

$ f (/3, B) ee $(/3+(B), 5, /i 1 ) - [$(/3 + (B), B, h { ) - $(/?, B, tf)]- 





Figure 2. Potts Bethe fixed points and the intermediate regime TZ-t 

Figs. 1-3 highlight the difficulty in analyzing the Potts model (q > 2) as opposed to 
the Ising model. Fig. la shows the Ising Bethe recursion parametrized in terms of the log- 
likelihood ratio r ee log h(+) — log h(—). For sufficiently large /3 the recursion has three fixed 
points, but in this case the r = fixed point is unstable, and we will see in the proof of 
Thm. 1.8 that adding a small magnetic field resolves the non- uniqueness. The remaining 
plots were computed for the Potts model with q = 30 and d — 4. Fig. lb shows the Potts 
Bethe recursion at B = restricted to those h which are symmetric among the spins ^ 1, 
and parametrized by r ee log h(l) — log h(2). The fixed point at r = corresponds to /i f while 
the uppermost fixed point corresponds to h 1 ; Fig. 2a shows how the fixed points vary with f3. 
In an intermediate regime of /3-values (shaded in Fig. 2a) both fixed points are stable, and 
perturbing by a magnetic field does not resolve the non-uniqueness: indeed, Fig. 2b shows 
that there is a two-dimensional region TZ-l of (/3,B) values for which h l ^ h 1 , making the 
exact Bethe prediction inaccessible via our current interpolation scheme. Fig. 3 shows the 
discrepancy between the upper and lower bounds of Thm. 1.10 (b) inside 1Z^. 
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Figure 3. Potts Bethe interpolation: the heavy (light) shaded regions are the asymptotic 
lower (upper) bounds on <p n given by Thm. 1.10; the bounds fail to match when B) € IZ^. 
The Bethe prediction is the upper envelope of the thick lines. 

In the figure, a shaded region marked "lbd /iT" (resp. "ubd fv") means an asymptotic 
bound on <j) n obtained from interpolation using the asymptotic lower (resp. upper) bound 
on dn(f3,B) by a e (/3, .B, /iT), in the notation of Thm. 1.13. For example, the shaded region 
labeled "ubd h 1 " is an asymptotic (lower) bound on <f> n obtained by interpolating from 
/3 = oo using the asymptotic upper bound limsupa° (0, B) < a c (/3, B, h) 1 . 

1.2.3. Independent set model. We consider the independent set model (1.4) in the regime of 
low fugacity. For f 6 {0, 1} let Ji^ = h^' denote the root marginal on T* with f boundary 
conditions on <9T* (i.e. at level t + 1). Let h\ = lim^oo (existence of the limits h?, hr 

for the independent set model follows from anti-monotonicity; see §2.4). We then define 
messages h$ G by h\^ y = h\,^, and let 

A c = \ c , u = inf{A > : fJ(h°£ y = h l £ y ) < 1} 

denote the uniqueness threshold. For T e T m , we use brT to denote the branching number 
of T (see [30, §2]). 

Theorem 1.11. Consider the independent set model (1.4) on G n — >i wc fi, and write A c = 
A c ,/x- 

(a) If A < A c and h has total variation bounded by a deterministic constant on [0, log A], then 

0(A) = $ M (A,/i°) = $ M (A,/ i 1 ), (1.13) 

which converges to 0(A C ) as A t A c . 

(b) IfbiT x ^ y < A — 1 jv -a.s. for A a deterministic constant, then (1.13) holds for A < A c 
with A(A - 2) < 1. 

(c) If n = 5j d , then (1.13) holds for A < A c . 
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For the ci-regular tree T d , the uniqueness threshold X c (d) is (d — /(d — 2) d (see [27, 
§2]), and [42, Thm. 2.3] shows that has the lowest value of A c among trees with maximum 
degree at most d. The identity (1.13) has been proved in the case that the G n are random 
(i-regular graphs [2, 3]. It is also suggested by Weitz's ptas for Zg(X) on a finite graph G 
of maximum degree A and with A < A C (A) [42, Cor. 2.8]. For /i a unimodular measure 
on T. giving a local tree approximation to G (in the sense of Defn. 1.1), A CjAt is often an 
improvement over A C (A), making it possible to compute 0(A) above A C (A) provided (H3' B ) 
can be verified. In [39] the interpolation scheme of Thm. 1.11 is refined to give a verification 
of the Bethe prediction on locally tree-like (i-regular bipartite graphs for all A > 0; this result 
is then leveraged to show inapproximability of the hard-core partition function on d-regular 
graphs above X c {d). 

1.3. Results for general factor models. We now state our results for the factor model 
(1.1). With the convention logO = — oo, let logtp = £ and logifi = £, and impose the 
regularity condition 

(HI) The specification is permissive, that is, ip(cr) > for all o G and there exists a 
"permitted state" a v G 3£ such that min^ ip(cr, o~ v ) > 0. 

For any o G £ B {cr) is continuously differentiable in B. For any a, a' G JT, 
£^(cr, a') is either identically — oo over all /3, or finite and continuously differentiable 
in (3. 

Recall Defn. 1.4 of the message space %. The belief propagation or Bethe recursion is the 
mapping BP = BP /3 ' B :H^H, 

(BPP> B h) x ^ y (a) = 1 J B (a) J] (1-14) 

with z x ^ y ((3, B) normalizing constants. For /i a measure on T, and fixed ((3, B), let "H^(/3, B) 
denote the space of measurable functions h : % — > A,^, again taken up to //^-equivalence, 
which are fixed points of the Bethe recursion: that is, satisfying 

h=BpP> B h, //-a.s. (1.15) 

For h G Hfj, we can define h : T. — > Ax up to /i-equivalence by 

h T (a) = 4,(a) J] (^V(^^)^o(^)). (1.16) 

j&do CTj 

In particular, if h G "H^(/3, B) and T G T. + , then clearly we have 

h T {o-) = y^-0((7, a^ho^jirfhj^oio-j) (1.17) 

CT 3 

independently of the choice of j G do. From now on, for h G "H M , we will write h G H* to 
indicate that h 13 ' 3 G "H^(/3, -B) for (/3, 1?) in the range being considered. 

Remark 1.12. The elements of W are consistent with the recursion structure of the tree in 
the following precise sense: for T G T. and U a finite connected sub-graph of T, consider the 
factor model i/J T on !7 with boundary conditions a v ~ h v ^ p ^ independently for v G dU, 
where p(v ) denotes the (necessarily unique) neighbor of v inside U . Then the marginal of 
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v T t T on T* 1 is exactly the factor model z/^ t p _'\ T on T* 1 with boundary conditions o u ~ 
(BP/i) M ^ p ( u ) independently for w G dT 1 ^ 1 , including any u which are leaves of T*. This 
statement remains valid if dT t or even <9T* _1 is empty, since if <9T* = then ^ t r is simply 
u T as defined by (1.1). Continuing the recursion up the tree, we see that h G W implies that 
the marginal law of a Q will be hr as defined by (1.16). From this it is easy to see that the 
measures Vy T form a consistent family of finite-dimensional marginals, so by the Kolmogorov 
consistency theorem they uniquely determine a probability measure = v t belonging to 
^Ti the set of Gibbs measures (or Markov random fields) associated to the specification 
t/j = on T. 1 (In fact this mapping is one-to-one, e.g. by Rmk. 2.3 below.) Each 

belongs to a special class of measures in ^ which are called Markov chains or splitting 
Gibbs measures in the literature, and the entire collection {vt)t&t. arising from h G %* has 
a consistency property which leads us to term them "unimodular Markov chains" or "Bethe 
Gibbs measures" (see §2.3). 

In this general setting, the Bethe prediction is the supremum of B, h) over H*(/3, B) 
(cf. Rmk. 1.6). (It will be shown in Lem. 2.2 that $ M is uniformly bounded on H^((3,B) 
subject to E^fZ^] < oo; if further ip > then $^ is in fact uniformly bounded on "H 
subject only to E^DJ < oo.) We define the following integrability condition for unimodular 
measures fi on T. (not necessarily arising from a graph sequence): 

(H2) The probability measure [ionT. satisfies E M [D ] < oo. If -0 is n °t everywhere positive, 
then furthermore E^fe 013 "] < oo for all c G M. 

Note that if G n -^i wc fi and ip > then (H2) holds trivially by the assumption of uniform 
sparsity. We will in fact justify our interpolation scheme under a weaker assumption than 
(H2); for the exact condition see (H2^), (H2 B ) in §2.2. 

1.3.1. Bethe interpolation. We will deduce the results of §1.2 from the abstract interpolation 
method given by Thm. 1.13 below, which bounds differences of <ft((3,B) by differences of 
$(/3, B, h) (h G H*) when the limiting expectation of a certain edge or vertex functional in 
the finite graph (capturing respectively d^ n or <9s0 n ) is bounded by the expectation of an 
analogous functional on the infinite tree. 

To be more precise, recall that I n denotes a uniformly random vertex of V n . Let (•)„ 
denote expectation with respect to vg„,& conditioned on G n . For h G %^(/3, B) and T G T., 
let []] r denote expectation with respect to v T (as defined in Rmk. 1.12), conditioned on 
T, and define 



aH^B) = ±E n [ £ (dptiWi)**], a*(P,B,h) ee ^E, [ £ \d£{a , a^l 



\ h /> IJ 

j&dl„ j&do 



a 



vx 



%B) = M n [(d B ^a In )) p f}, a™(/3,B,h) = E J[«9 B £> )]^' B> 



The left-hand expressions are the derivatives d/3<j) n , dB<f) n (Lem. 2.1). The right-hand expres- 
sions are the infinite-tree analogues, which we will show in Propn. 2.4 may be thought of as 
derivatives in /3 and B of 



Strictly speaking the term "Gibbs measures" refers to the case ip > 0, but we will follow common practice 
and say Gibbs measures also for the general case. For the general theory of Gibbs measures see e.g. [19]. 
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For interpolation in (3 on a compact interval [f3 , (3i\ using h G "H*, we require the following 
regularity condition: 

(H3 13 ) On [/3 ,/3i], /i^-a.s. the functions /3 hj_> (a) are continuous with total variation 
bounded by a deterministic constant depending only on /3 , Pi- 

We define the analogous condition (H3 B ) for interpolation in S on a compact interval 
[Bq,Bi\. The condition of boundedness in total variation is implied for example whenever 
the functions h are (anti-)monotone in the interpolation parameter. 

Theorem 1.13. Let z/> = (Vs V>) specify a factor model (1.1) on G n -^i wc \i such that (HI) 
and (H2) are satisfied. 

(a) If on [/3o, A] we have h G W satisfying (H3^) ; and 

limsup<(/3,S) < a c (/3,S,/i), (1.18) 

then limsup^J^A, S) - 0„(A>, 5)] < S, ft) - $(/3 , S, h). 

(b) If on [B ,Bi] we have h G H* satisfying (R3 B ), and 

limsup< x (/3 ;J B) < a vx (/3,S,/i), (1.19) 

tfien Umsup^J^&Si) -</>„(/?, S )] < $(/3, S x , /i) - $(/?, S ,/i). 
T/ie same results hold if all inequalities are reversed, replacing limit superior with inferior. 

The conditions (1.18), (1.19) (and their reverses) are automatically verified in the following 
special case: 

Theorem 1.14. Let ^ = (ip,ip) specify a factor model (1.1) on G n -^i wc fi satisfying (HI) 
and (H2). We say that uniqueness holds if ctt ((3,B) consists of a single measure v T , 
[i-a.s. In this case, %*(/?, S) is a singleton. 

(a) If on [A),/3i] x {B} uniqueness holds and the unique element h G H* satisfies (H3^), 
then 

lim [0 n (ft, B) - 0„(/3 o , B)} = $(/3 x , B, h) - $(/3 , B, h). 

n— >oo 

// on {/?} x [Sq, Si] uniqueness holds and the unique element h G H* satisfies (H3 B ), 
then 

lim [0 n (/3, 50 - n (/3, So)] = $(/?, Si, h) - $(/?, S , /i). 

Uniqueness for £fy corresponds to the vanishing effect of boundary conditions on <9T* as 
t — > oo [19, Ch. 7]. Dobrushin's uniqueness theorem (see e.g. [37]) gives a sufficient condition 
for uniqueness to hold, together with a bound on the rate of convergence of the root marginal 
in T* to the limit as t — > oo. Note that if the convergence rate is uniform in /3,S then the 
continuity required in (H3^) and (H3 B ) immediately follows. We will obtain continuity in 
uniqueness regimes via a different route, making use of certain monotonicity properties (see 
the proof of Thm. 1.8). 
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1.3.2. Variational principle. We further develop the theory by providing a variational prin- 
ciple for the Bethe prediction: we express $^(/3, B) as an optimum of a function $ At (/3, B, h) 
defined for h in a larger space "Hi oc which, unlike "H^(/3, B), is independent of 0, B. This al- 
ternative characterization of $^ is the infinite-tree analogue of the finite-graph optimization 
problem that is considered in [44]. Recall from §1.1 that % denotes the space of trees rooted 
at a directed edge. 

Definition 1.15. The local polytope T-L\ oc = T-Li oc ,fi is the space of measurable functions 

h : % -> A<r2, (T, x ->■ y) h(r, x -+ y ) = h xy , 
taken up to /^-equivalence, such that 

(i) h xy (a,cr') = h yx (a',a) for all a, a' G JT, and 

(ii) for T G 7^ + , the one-point marginal h x (a) = h(r,x)(°') = J2a y h X y(cr,a y ) is well-defined 
independently of the choice of y G <9x. 

We also define 

"Hkx#] = G "Hioc : //(supp/i C supp^) = 1}, 
?CcM = G "Hioc : //(supp/i = suppV>) = 1}. 
In accordance with (1.16), we set 

h T (a) S ^(<r) if T = T°. (1.20) 

For fixed ((3, B), by symmetry of ipP and (1.17), the space H*(/3, -B) has a natural mapping 
into %\ QC given by 

hh^-h, h xy (a,a') = ip(a,a')h x ^y{a)h y ^ x (a'). (1.21) 

With ^ permissive this is in fact an embedding; see Rmk. 2.3. We define the Bethe free 
energy functional on l-L\ oc by 

%{h) = E M [ (aa ))- ho - (D - l)H(h ) + M°o, °j)) hoj + H(h oj ) }] , (1.22) 

jedo 

where H(p) denotes the Shannon entropy — ~^2 k Pk\ogPk for p a probability measure on a 
finite space. This is an infinite-tree analogue of the definition of [44, (37)-(38)] for finite 
graphs. With the usual conventions logO = — oo, OlogO = and 01og(0/0) = 0, $ M 
is bounded above on T-L\ oc whenever E M [£) ] < oo, and we show in Lem. 3.1 that for /i 
unimodular, this $ M extends the previous definition (1.9) on W (under the embedding 
(1.21)), provided the latter is finite. Furthermore, writing H(q || p) for the relative entropy 
qk \og(qk/pk) between q and p (well-defined for any non-negative reference measure p), 
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for jj, unimodular we can alternatively express 

= -Ep[H(h || n -\^[Y1 H ( h °i II ^)] " E m [A>#(/0] (1-23) 



jedo 



= -Efj,[H(h || $)] - ^E M [ ^ {if (/i oj || V) + H{h ) + tffo 

= -E a [H(h || $)] - [I] H ( h <d \\h0X4, hj)] > (!-24) 

where (/i x^ hj)(a ,aj) = h (a )ip(a , aj)hj(aj), and unimodularity is used in the second 
identity. 

This extended definition of Q u provides the following variational principle for the Bethe 
free energy: 

Theorem 1.16. Let = (ip,^) specify a factor model (1.1) satisfying (HI), and let fi be a 
unimodular measure on T. with E, U [D ] < 00. 

(a) $n(/3,B) = sup hen]oc $^((3, B,h) is continuous in (f3,B). 

(b) Any local maximizer of & n ((3, B) belongs to Hi oc [4>}- Any stationary point of & u ((3, B) 
belonging to T-L° oc [ip] is the image under (1.21) of an element of %*({3 , B) . In particular, 
if $ M attains its supremum on H.\ QC , then 

B) = max $ M (/3, B, h) = ^ ethe (f3, B), 

so that the Bethe free energy is also continuous in (/3,B). 

Although we do not pursue this point, we mention that even in specific models where the 
abstract definition of $ Bethe is supplanted by $(/3, B,h) for some "naturally" distinguished 
h, an adaptation of Thm. 1.16 (involving a restricted subspace of "Hi oc which is independent 
of (f3, B)), may be relevant. 

Remark 1.17. In case G n -^i wc T d the <i-regular tree, T-L\ oc is parametrized by a single 
measure h xy on i?f 2 whose one-point marginals are required to agree, and the formula (1-24) 
simplifies to 

- = H(h || $) + ^H(h 01 || h x^ h x ). (1.25) 

For a e let = ^ev n ^ and L e n = (2\E n \y l E(i 3 <)&d S (w) + <W*)1 denote 

the induced empirical and pair empirical measures respectively. If G n is ci-regular, then the 
one-point marginals of L n coincide with L™, and 

I r r fid ~1 " 

n = log|Jf| + -E n ^fogE Sn exp |n(^) L vx + j 

where the law of a is the uniform measure u n on and E Sn denotes expectation with 

respect to u n (with G n fixed). 

If (G n ) is an independent sequence of uniformly random d-regular graphs and g_ n ~ u n , one 
might guess that for a.e. (G n ) the induced sequence L n satisfies a large deviation principle 
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(ldp) with good rate function 




(1.26) 



where u = u\. If this were the case, it would be an immediate consequence of Varadhan's 
lemma (see [10, §4.3.1]) that <fi n — > §^((3,13) (as defined in Thm. 1.16) for any factor model 
satisfying (HI). However, for many of these models the Bethe prediction is known to fail 
at low temperature for d > 3. So, while Thm. 1.16 suggests a potential connection to 
large deviations theory, such a connection would be highly non-trivial and applicable only 
in certain regimes of (/3, B). 

One special case in which everything trivializes is the (rooted) infinite line T2, the local 
weak limit of the simple path G n on n vertices. In this case u n may be viewed as the law of 
a stationary reversible Markov chain on S£ with transitions q(a, a') = u(a') and reversing 
measure u, and it is well-known (see e.g. [10, Thm. 3.1.13]) that the associated pair empirical 
measure L e n satisfies an LDP with good rate function I(h i) = H(h i(a,a') \\ h (a)q(a,a')) 
which matches (1.26). The implication of Varadhan's lemma is also easy to see: a factor 
model on the simple path G n with general positive specification ^ corresponds in the limit 
n — > 00 to a reversible Markov chain with transition kernel p and positive reversing measure 
7r given by 



where p and m are the Perron- Frobenius eigenvalue and eigenvector of the symmetric positive 
-dimensional matrix with entries tp(cr,a') = ^(ff, ff')t/|(^^) 1 '' 2 1/;((r') 1, ' 2 . The Bethe free 
energy functional (1.25) is then maximized at hoi(a,a') = ip(a,a')m(a)m(a ; )/p, where it 
takes the value $ M (fo) = logp which coincides with (p by the Perron-Frobenius theorem (see 
e.g. [10, Thm. 3.1.1]). 

Outline of the paper. 

• In §2 we prove the abstract interpolation results. §2.1 presents some preliminary 
lemmas which will be useful in our proofs. Our main result for abstract factor mod- 
els, Thm. 1.13, is proved in §2.2. §2.3 contains the specialization of this theorem to 
the uniqueness case (Thm. 1.14) and also contains discussion on unimodular Markov 
chains (or Bethe Gibbs measures). §2.4 shows how to deduce our result for indepen- 
dent set (Thm. 1.11) from Thm. 1.13. 

• In §3 we prove the variational characterization Thm. 1.16 for the Bethe free energy 
prediction, establishing in particular the correspondence between interior stationary 
points h G H° oc [^] of $^ and fixed points h G of the Bethe recursion. We 
further provide in Propn. 3.4 a simple criterion for such stationary points to be local 
maximizers. 

• §4 contains applications of our abstract results to the Ising and Potts models. In 
§4.1 we prove Thm. 1.8, generalizing the results of [9, 11]. In §4.2 we prove Thm. 1.9 
by appealing to a random-cluster representation. Finally §4.3 analyzes the <i-regular 
case and proves Thm. 1.10. 



p(a, a') 




7r(cr) = m(cx) 2 
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2. Bethe interpolation for general factor models 

2.1. Preliminaries. We begin with some straightforward observations on the boundedness 
of the free energy <fi n and the Bethe free energy $ M as defined on ~H, and we prove that the 
mapping (1.21) of W into "Hi oc is in fact an embedding for permissive specifications. 

Lemma 2.1. For the factor model (1.1) satisfying (HI) onG n —±i wc \i, the functions <fi n ((3 , B) 
are uniformly bounded and equicontinuous on compact regions of (/3,B), with 

d p cfi n {(3,B) = -E n [dp\ogZ n (P,B)\, d B cfi n {(3,B) = -E n [d B \ogZ n (P,B)\. (2.1) 
Further 

U p \ogZ n (^B) = I £ (^K,^)>r, -d B \ogZ n {P,B) = {d B l(a In )Y;\ 
n jedi n U 

with the convention dp£(a,a') = in case £(cr, er') = — oo. 

Proof. The expressions for n~ 1 dp log Z n (f3, B) and n -1 ^ log Z n ({3, B) are obtained by a 
straightforward computation. Now note that if G n -^i wc \i then the uniform sparsity as- 
sumption gives 

^E n [\E n \] = ^E n [D In ] h 

Let (/3, B) vary within a given compact region. By (HI) we have £, £ < £ max as well as 
^(o rP , '),£> Cmin- Therefore 

(1 + \E n \/n)Un < n- 1 logZ n (P, B) < log \X \ + (1 + |£„|/n)£ max 

so (fin = n _1 E n [log Z n (p, B)] is uniformly bounded by uniform sparsity. The exchange of 
differentiation and integration in (2.1) is justified by Vitali's convergence theorem, in view of 
the boundedness of dp£, d B ^ and the uniform integrability of \E n \/n. It follows furthermore 
that dp(fi n ((3,B) and d B <fi n ((3, B) are bounded uniformly in n, from which equicontinuity 
follows. □ 

Lemma 2.2. Let ^ = specify a factor model (1.1) satisfying (HI), and let fi be a 

unimodular measure on T.. For any compact region of (/3,B) there exists a deterministic 
constant C < oo such that 

(a) \$ T (p,B,h)\ < C[pl + 1) for any h G W, and 

(b) if further ip > 0, then |$ T (/3, B,h)\< C(D + 1) for any heU. 

Hence, on any compact region of ((3, B), $^ is uniformly bounded on H* provided E a [D^\ < 
oo, and, ififi>0, uniformly bounded on "H M subject only to E U [D ] < oo. 

Proof. Let £ m in>£max be as in the proof of Lem. 2.1. Then, for any h 6 H, 

log \X | + (D + l)£ max > $ v T x (h) > (D + l)£ min . 



: E n [\E n \] = -E n [D In ] ->• -E M [D ] < oo. (2.2) 
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If ip > then we also have 

min) 

so |$t(/3, B,h)\ < C(D +1) on H, which proves (b). For general permissive ■?/>, the preceding 
lower bound on may fail, but (1.15) implies that for h G "H*, 

log V^V) > D (U m ~ Cmax) - log|iT|, Vj G do. (2.3) 

Therefore, 

£>ofmax > 2®T(h) > (U + log /i ^ ((X P )) > D„(U - log \S£\) + D„ 2 (U - £ max ), 

jedo 

which proves (a). □ 

Remark 2.3. It is now easy to see that the mapping (1.21) of %*(/3, B) into T-L\ oc is injective: 
if h,h' E*H give rise to the same h, then 

for z XtV a positive scaling factor. If h, h' G "H*(/3,I?), then (2.3) implies that //1-a.s. both 
h y ^ x and /i^j. give positive measure to cr p . Therefore, //1-a.s. the -dimensional vectors 
/i^-^ and h' x _yy are equivalent up to scaling, and since both are probability measures on i?T, 
we must have h = h! /i^-a.s. as claimed. 

2.2. Bethe interpolation. We now prove Thm. 1.13 (a). The result is for fixed B so we 
suppress it from the notation. The proof of Thm. 1.13 (b) is very similar and will be given 
in brief at the end of this section. 

Our interpolation procedure relies on the proposition below which expresses <3> M as the 
integral of its partial derivative with respect to (3 only, ignoring the dependence on /3 through 
the function h. Recall that although it is suppressed from the notation, and h depend on 
(3, and are taken to be evaluated at (3 in expressions such as &t{(3,B). We will prove our 
result under the following integrability condition, which by (2.3) is a relaxation of (H2): 

(H2 /3 ) The probability measure fionT. satisfies E^-DJ < oo. If ip is not everywhere positive, 
then furthermore ^ 

2_, SU P ~r < °°- 

We define the analogous condition (H2 S ) on an interval [Bq,Bi\. 

Proposition 2.4. Let ^ = (VsVO be a specification satisfying (HI), and // a unimodular 
measure on T.. If on [A),/3i] we have h G W satisfying (H2 /3 ) and (H3^) ; then 

$(P 1 ,h)-$(P ,h)= / a e (^h) d^ 

ho 

Proof. For fixed T G T. we shall regard $r simply as a function of a vector (/3, /iz-^x-n/eT 1 in 
(1 + 2|«^T|D )-dimensional euclidean space (with h depending on (3). We begin by computing 
the partial derivatives of this function with respect to (3 and h. We abbreviate h^j(a) = 



20 A. DEMBO, A. MONTANARI, AND N. SUN 

(BP^h) _^j(a) for the belief propagation mapping of (1.14), which for fixed T and each j G do 
is a well-defined function on the same euclidean space as $y Making use of (HI) we find 

If ft G "H* then ft^ = ft, therefore (recalling the notation from §1.3.1) we re-express the 
above as 

jedo 



and combining gives 



CM) = \ Y,l d ^ a o,CT j )t/ = a e T (P,h). (2.6) 



dp " ' ' 2 
Likewise we compute that for T G 7^ + , 

d$F(j3,h) n <9$^(/3,ft) 7 \ EX^^l^W 



dhj^. (aj) 2 J dft ^(cr) 2 

where is the same as but with ft, in place of ft. Note that for permissive ip and any 

(TGf, 



— > y ft) < < <a X (2 7) 

7 ' E CT '^> P )W^ P )^(<) V&nW* p ) 



If further t/> > everywhere, then g^(x —ty]h)< ^max/Vmin i s uniformly bounded on [/3 , Pi]. 

Consider now a small sub-interval [(3, (3 + S] of [/3o,/3i]. Writing A^ft = ft^ 4 " 5 — ft' 3 and 
applying the mean value theorem to the different iable function t h- >■ + t<5, ft + tAp^h) 

for t G [0, 1] gives 

$ T (/3 + 5, ft) - $ T (/3, ft) = ^(/3 + tr5, ft^ + tA^ft^ + T T (/3, 5) + E T (/3, 5) (2.8) 
for some t = tpj G [0, 1], where 



dh x ^ y (a) ' dh x ^ v (a) 

a x—>y us/ a \ t 

and EI->y indicates the sum over the 2D Q directed edges x — > y within T 1 . 
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Setting 5 = 5 m = — (3 )/m, we now sum <E>(/3 + S m ,h) — $(/3, /i) over /3 £ II m = 
{/3o + fc5 m '■ < k < m} and analyze separately the contribution of each term on the 
right-hand side of (2.8): 

(a) First we show that E^[T T (f3,5)} = for any + 5} C [p , Pi]. Indeed, since h e H* 
we have /i^ = ftr and "g^ = g 13 . Therefore, 

The result then follows from unimodularity of /i, subject to /i-integrability of 

Clearly ^^^^^(o - )] < 2 so integrability certainly holds when ip > 0, since E M [£)J < oo 
and gP is deterministically uniformly bounded on [f3 , px] as noted above. More generally, 
for permissive ip the required /z-integrability follows from (2.7) and (H2^). 

(b) The total contribution of the first term on the right-hand side of (2.8) is 



r v CsPr a 1 

/3en m ' 

Observe that A m = fY m d(X x jj) where A is Lebesgue measure on [/3 ,/3i] and 



Y m (P',T) = J2 HP <P'<P + + 



,3en m 

For (A x /i)-a.e. (/?', T), this sum has at most one non-zero term, in which the argument 
of <9/3$t converges by (H3^) to {P' ,hP ) as m — > oo. From (HI), (1.14), and the compu- 
tation of dp^T in (2-4)-(2.5), we see that dp<&T(P,h) is continuous in (j3,h). Therefore, 
Y m (j3',T) — > a^(P',h), (A x /x)-a.e. Furthermore, (HI) implies that \d/3^\ < C uniformly 
on [flo, f3i] for some deterministic constant C, so \Y m \ < 2CD a for all m, (A x /i)-a.e. (see 
(2.4) and (2.5)). Dominated convergence then gives 

lim A m = [ a e T ((3',h) d{\ x //) = / * a e ((3',h) dp'. 

m ^°° J J p 

(c) The contribution of the final term in (2.8) is E^fE^j] where 

EiT,m = ^ E T (/3,5), 
/3en m 

and we conclude the proof by showing that lim n j,_*. 0O E (U [ET, m ] — 0. 

Indeed, it is not hard to see that \\Ta. rn ^ too ET, m — /i-a.s.: by the uniform bound on 
total variation assumed in (H3^), there exists deterministic C such that 



\ET,m\ < C max sup sup 

— ' " /3e[/J ,/3i]te[o,i] 



(P + tS m , + tA^ m h) - - \ r (P, h) 
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/i-a.s., uniformly in m. It also follows from (H3^) that /z-a.e. h" is uniformly continuous 
on [/3q, Using (HI), the partials dh^T computed above are uniformly continuous in 
(P, h) for (3 G [/3q, /3i] and /^ (cr p ) uniformly bounded away from zero. By (2.3) there 
exists deterministic c such that 

inf hP(a p ) > e - c( ^ +1) Vj G do, w-a.s.. 

pe[Po,pi] J ^ oK ~ 

Combining these observations gives liuLm—^oo Ef^m — /x-a.s. 

To take the limit in /i-expectation, we argue similarly as in part (a): by (2.7) and (HI) 
there exists deterministic C such that 



(P + 15, + tA M h) 



dh x -> y ((j 



a c 

< — < sup 



R — ' ' of 

h^ y (crP) + tA0 >5 h x ^.y(aP) P'e[p ,pi] K-+y(<jv) 
for all P G [po, Pi - 5], x -> y G T 1 , a G 3C and t G [0, 1], hence 

\E T , m \ < CC'2_\ sup — B — - — -. 

x ^ y PelPoM hZ^y((T p ) 

This is integrable by (H2") and unimodularity of fi, so dominated convergence implies 
that lmim^oo E M [-B^to] — as claimed. 
Combining (a)-(c) gives the result of the proposition. □ 

Proof of Thm. 1.13 (a). Recalling Lem. 2.1, 

rPi 

limsup[0 n (/3i) - (p n (Po)] = limsup / a c n (P,B) dp 

n— >oo n— >oo J Po 

rPi rPi 
< / hmsnp a e n (P,B) dp < / a c (P,h) dp, 

J Po n^oo J p Q 

where the first inequality follows by (the reversed) Fatou's lemma and the second one by the 
hypothesis (1.18). By Propn. 2.4 the right-most expression equals to $(/3i, h) — $(/3o, h), so 
the theorem is proved. □ 

The justification for interpolation in B is entirely similar: 

Proof of Thm. 1.13 (b). Now P is fixed so we suppress it from the notation. For h G H and 
T eT. + then 

d$ T (B,h) d^ x (B,h) Y J „,a 1 dBi{o-)^{a ) or j )h j ^ {o- j )h ^ j (o) 

= = - ^ V? G oo, 

dB dB VK^i) V^) W^) 

while if T = T° then d B <$> T = £ ff B f (*)$(*)/ E CT HheU* then h B = h B } so 

E„[d B $ T (B,h)] = E,ild B Z(o- )} h T B ] = a™(B,h). 

The result now follows by adapting the proofs of Propn. 2.4 and Thm. 1.13 (a). □ 
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2.3. Discussion and first consequences. We now prove Thm. 1.14 by considering an 
extended notion of local weak convergence. As discussed in [1], a graph G = (V, E) together 
with a spin configuration a G SC on the graph can be regarded as a graph with marks in 
S£ . Let Gf and Gf % denote the spaces of marked isomorphism classes of connected, rooted 
and bi-rooted graphs, respectively, with marks in S£ . These spaces are metrizable by the 
obvious generalizations of the metrics on Q.,Q mm defined in §2.1, giving rise to the notion 
of local weak convergence for pairs (G n ,a n ) of graphs with spin configurations. Defn. 1.2 
generalizes naturally to this setting, and we show next that if a n is a random configuration 
on G n with law vg k ,± ( as defined in (1.1)), then a local weak limit of (G n ,a n ), if it exists, 
must be unimodular. 

Lemma 2.5. IfG n -^i wc n and q_ n ~ va n ,±_ then the laws of(G n ,a n ) have subsequential local 
weak limits belonging to the space <% of unimodular measures on Qf . 

Proof. For each fixed t, the laws of B t (I n ) are weakly convergent, hence by Prohorov's 
theorem form a uniformly tight sequence. Consequently, for each e > there exists K e C Q, 
compact with sup n F n (B t (I n ) K e ) < e. Further, K e may be taken to contain only graphs 
of depth at most t, whereby the minimal distance between any two graphs in K e is uniformly 
bounded below (by l/(l+t)), hence the compactness of K e implies that it must be a finite set. 
The collection of all marked graphs in Gf whose underlying graph is in K t must therefore be 
finite, hence compact as well. Thus, by yet another application of Prohorov's theorem, the 
joint laws of (B t (I n ) , a Bt u\) are uniformly tight in Gf and consequently have subsequential 
weak limits. By extracting successive subsequences for increasing t and taking the diagonal 
subsequence it follows that the sequence (G n ,a n ) admits subsequential local weak limits 

fieW. □ 

For /i6 f , the marginal // of ju is a unimodular measure on Q.. If it is supported on a 
single tree T as in the <i-regular case, then clearly // may be represented as St x v where 
v G ^Ti the space of Gibbs measures on T corresponding to specification To make such 
a statement in the general setting, note that there is a continuous mapping 7r from Q, to 
the space M. of graphs on Z> rooted at 0, taking an isomorphism class to its canonical 
representative [1, p. 1461]. Thus ju may be regarded as a measure on the product space 
Af. x «^2f z ^°, and consequently fi, has a representation as the measure \i ® v on pairs (T,a) 
where T has law [i and a given T has law vt G 'St- I n particular, if \&t\ — 1 /i-a.s., then 
fx ® v is uniquely determined. 

Let /ibea unimodular measure on T.. It was noted in Rmk. 1.12 that there is a mapping 
from 'H*(^,B) to collections [y T G Wt)tgT.- For such u, fj, ® v belongs to % : if / is a 
non-negative Borel function on Gf^ , it follows from the 7^-measurability of elements of T-L\ oc 
that 

[ ^(( T ' £)> °> ■?')] = E 4 °> ft] ' 

j&do j&do 

where / is a non- negative Borel function on Q... The unimodularity of the underlying 
measure \i then gives 

E^ v ^2f((T,a),o,j) = E^ u [^2 f((T,a),j,o) 

j&do j&do 
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and therefore \x ® v G % . 

Remark 2.6. An element v G 5fx is called a Markov chain (or splitting Gibbs measure) if 
for any finite connected sub-graph U C.T, the marginal of ^ on {7 is a Markov random field 
([45], see also [19, Ch. 12] and [41]). A collection A r = (A]) 

(ij)eE T °f probability measures 

on is called an entrance law (or boundary law) for the specification ^ = {ip.,ip) on T if it 
satisfies the consistency requirement ([45, (3.4)]) 

k(zdi\j cr k 

where ^iji^h a j) = '4'(o') 1 ^ Di 'ip(a,a')%jj( y a') 1 ^ Dj , the pairwise interaction potential correspond- 
ing to if)_. It is shown in [45, Thm. 3.2] that there is a one-to-one correspondence between 
Markov chains v and entrance laws A^, given by 

(ij)eEu i&dU o-j 

for [/ any finite connected sub-graph of T, with p(i) denoting the unique neighbor of i inside 
U for z G In particular, we see that the Gibbs measure Vt arising from h G H*(^,B) 
is precisely the Markov chain with entrance law A* (a) = hj^i{a)%f){a)^ 1 ^ D: > . Extremal ele- 
ments of are Markov chains [45, Thm. 2.1] but the converse is false; for example, the 
free-boundary Ising Gibbs measure is non-extremal at low temperature (see [13, 25]). The 
measures ii <g> v arising from elements of B) might naturally be termed "unimodular 

Markov chains" or "Bethe Gibbs measures," in the sense that the entrance laws for the 
entire collection (^t)tsT. are specified by a single measurable function h : T e — > which 
is a Bethe fixed point. In the case /i = 5j d these correspond precisely to the completely 
homogeneous Markov chains studied in [45, §4]. 

Proof of Thm. 1.14- Suppose uniqueness holds at (/3, B), i.e. Wt = {^t} /i-a.s. Then H*(/3, B) 
has size at most one by Rmk. 2.3. For /i-a.e. T, the measure vt is extremal, and so spec- 
ifies a Markov chain on T with entrance law At (see Rmk. 2.6). If we define h x _^ y (a) = 
h {T ^ y) (a) S Xl(a)4)(o-) l l D % then h G H*(fi,B), which proves that "H^(/3, B) is a singleton. 

Now consider interpolation in (3 or B. All the conditions of Thm. 1.13 are satisfied by 
assumption except (1.18) and (1.19). If uniqueness holds at ((3,B), it follows from the 
preceding discussion that there is a unique // <8> v G corresponding to the specification 
(ip 13 ,ijj B ). Any local weak limit of (G n ,a n ) must be such a measure, so (G n ,a n ) -^i wc n®v; 
likewise, any element of H*(/3, B) gives rise to ii £g> v. Therefore, 

lim a e n (f3,B) = V J V^K^)1 = a e (/3, B, h), 

n— >oo Z L ' 

jedo 

where the limit in expectation is justified by the boundedness of on compacts and uniform 
sparsity (as in the proof of Lem. 2.1). This verifies (1.18), and the verification of (1.19) is 
entirely similar. The result therefore follows from Thm. 1.13. □ 

Remark 2.7. If uniqueness of Gibbs measures does not hold, one may consider extremal 
decomposition of the subsequential local weak limits fx of (G n ,a n ), either in the spaces 5fy 
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(possibly losing unimodularity in the decomposition) or in the space % . Extremal decom- 
position in % is discussed in [1, §4] but it is unclear whether extremal elements would be 
unimodular Markov chains in the sense described here. A decomposition of // = /i <S> v 
into unimodular Markov chains \i%v' would obviously yield a substantial generalization of 
Thm. 1.14. 

2.4. Application to independent set. We now prove Thm. 1.11, our result for the in- 
dependent set model (1.4), by verifying the conditions of Thm. 1.14 for the interpolation 
parameter B = log A. In this setting a convenient parametrization for the messages h G % 
is u = h(0), so that the BP mapping (1.14) becomes 

(BP x u) x ^ y = - — - 1 . (2.9) 



\v 



u, 



A single BP iteration is anti-monotone in the messages u v ^ x , so a double iteration is mono- 
tone. Since the root marginal for an independent set model in T 2t ~ l is obtained by an even 
number of BP iterations starting from level 2t (see Rmk. 1.12), it is monotone in the bound- 
ary conditions. Recalling from §1.2.3 the definition of = fry for $ G {0, 1} and writing 
Urf = /^(O), the above implies that for 1 < s < t, 

-2s-l,0 ^ -2t-1.0 ^ -2t-l,l ^ -2s-l,l ^ -1,1 1 

Thus the t — > oo limits h^,h]^ are well-defined with h^(l) > h^(l) > 1/(1 + A), and using 
these we define messages e "H, h\_^ y = Jv T . The next lemma gives the boundary values 
for the interpolation. 

Lemma 2.8. For the independent set model on G n — »z wc H, 

limlimsup|0 n (A)| = = lim $(A, h x ), \ G {0,1}. 

Proof. The left limit follows from the trivial bounds 1 < Z n < (1 + A) n . Next, for any h EH, 

$5?(A,/i)=log{l + Aj] 

j&do 

so $™(A, h) — > both /i-a.s. and in /i-expectation as A I 0, by bounded convergence. The 
same holds for $f,(A, /i*), J G {0, 1}, using the bound /i*_>»( ) ^ 1 /( 1 + A )- D 

Proof of Thm. 1.11. The independent set model (1.4) is of form (1.1) with S£ = {0,1}, 
ip(a,a') = l{aa' ^ 1}, and $(a) = X a = e Ba , so (HI) is clearly satisfied with a p = the 
permitted state. By definition of A c , if A < A c then h° = h 1 = h in "H, and it then follows 
from the recursive structure of the tree that h G K*(A). Since /4-m/O) > 1/(1 + A ) as noted 
above, (H2 B ) is satisfied on any compact interval of A. 

For T 6 7i, as noted above the root occupation probability on T s for s > 2t — 1 with 
any boundary conditions is sandwiched between ' (1) and /iy -1 ' (1), with the former 
increasing to ^y(l) and the latter decreasing to ^(1). Since the are clearly continuous 
in A, it follows that h^(l) and h\(l) are respectively lower and upper semi-continuous in A, 
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so if they coincide then their common value ^r(l) is continuous in A. Applying this with 
T = T x _^ gives the /i^-a.s. continuity of h x __> y on (0, A c ). 

For T G T., hj, for f G {0, 1} is a function of (hjZ^ )jedoi so f° r A < A c we have that 
hj< = h\, /x-a.s. It then follows from the preceding observations and Rmk. 1.12 that the 
boundary effect vanishes and \&t\ — 1 /i-a.s. Thus, we are in the setting of Thm. 1.14 (b), 
and it remains only to complete the verification of (H3 B ), i.e. the boundedness in total 
variation of the messages h x ^ y : 

(a) No verification is needed since boundedness in total variation is simply assumed. 

(b) For T eT.,v$ = J#{0) satisfies 

i og 4<+« = _ io g A + a n 1 s2 ,-o. ) • 

Differentiating with respect to A, we find that = (1 + X)d\ logWy satisfies 

| 4 « + M | < 1 + _A_ flo+ (_A_) 2 E E |rr u,. (2 . 10) 

j'eOo k£dj\o 

Since w^ 1 = 1/(1 + A) for any T G 71, we find that 

- £>1 

If Ai/(1 + Ai) < 1/brT then this is finite and uniformly bounded on [Ao, Ai] (see [30, 
§2]), and consequently = lim^oo u^* -1 ' 1 has deterministically bounded total variation on 
[Ao,Ai]. If Ai < A C)/i then h^y' 1 — > h x _+ y on [A ,Ax], so if brT x ^ y < A — 1 /x^-a.s. and 
Ai/(1 + Ai) < 1/(A — 1) (i.e. Ai(A — 2) < 1), then h has deterministically bounded total 
variation on [Ao,Ai]. 

(c) Since the limiting measure is supported on T d , only h = h(j d>x -^v) is of relevance, and 
(2.9) reduces to BP A w = (1 + Aw d-1 ) -1 . For A < A c = A c (ci) there is a unique fixed point (see 
[27, §2]), which is then easily seen to be monotone in A. 

Thus (H3 B ) is verified in parts (a)-(c). Also, 0(A C ) = lim A1 - Ac 0(A) as an immediate 
consequence of Lem. 2.1. The rest of the theorem follows by applying Thm. 1.14 and then 
taking B = log A — > — oo, relying on the boundary value given by Lem. 2.8. □ 

3. Bethe prediction as optimization over local polytope 

Throughout this section we assume that z£ = (if), ip) satisfying (HI) specifies a factor model 
(1.1), and that \i is a unimodular measure on T. with E M [.D ] < oo. We study the Bethe 
prediction as the optimization of the Bethe free energy functional $ M on H\ oc as defined by 
(1.22). We first verify that this agrees with the previous definition (1.9) of $ M on 7-L*((3,B), 
which we always regard as being embedded into T-L\ oc via (1.21). Recall from Defn. 1.15 that 
for h G %\oa the one-point marginals of h xy are denoted h x and h y , and are measurable 
functions T. — > A%-. 
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Lemma 3.1. The functional $^ on l-L\ oc given by (1.24) agrees with the previous definition 
(1.9) on "H*(P , B) , subject to finiteness o/E M [$y]. 

Proof. If h corresponds to h G £?), then (1.21) and (1.15) imply that 

h xy (a, a') exp{^ y \h)} = ifj(a, a')h x ^ y (a)h y ^ x (a'), 

h (a)exp{^(h)}=^(a) J] V^, oy^i-ofo)) ■ 

Letting 3>w(/i) (1 < i < 3) denote the three terms on the right-hand side of (1.23), it follows 
from the above that 

= E M [$3?(/i)] - E M [ ^ £ h (a) log ( ^ ^((7, a,)!'., .Jo,)) 

j&do ct CTj 

& 2 \h) = E^ T (h)} - ^E, [ £ E M*, kg( Wa)Z* 

jS9o CT,CTj 

= E M [$|,(/i)] -eJ^T^T lo § W") 



>i)) 



jG9o ct 

$^(h) =1^^ log ( £ ^ W*) V>°fa 

j&do a <Tj 



2E M [^(/i)], 



where unimodularity was used in the simplification of &( 2 \ Adding these three identities 
gives = E M [$^ x (/i) - as claimed. □ 

As mentioned in §1.3.2, our definition $ M of the Bethe free energy functional on T-L\ oc is an 
infinite-tree analogue of the definition of [44] for finite graphs. It is proved in [44, Propn. 6] 
that when ip > 0, all local maxima of the Bethe free energy lie in the interior of the local 
polytope. We now prove an analogous result for infinite unimodular trees, assuming only 
permissivity of 

Proposition 3.2. For permissive if h is a local maximizer of $ M over T-L\ oc then h e 

Proof. Assume without loss that h e %i O c['0]> since otherwise clearly $ M (/i) = — oo. If 
u G Hiocty], then it follows by convexity of TLi oc that h v = h + r](u — h) = h + i]8 belongs 
to T-i\ oc [ip] for any 77 e (0, 1]. Letting 

BJ>(6) = - W(S) ' lt " {6) 



V 



log 7 ? I 



our claim will follow upon showing that if h ^ %° oc ['0]> then there exists such -u for which 

lim^(tf) = Pl°(6) > 0. 

774,0 

To this end, note that by an easy computation [H(h r, ) — H(h)]/r] = — (log h v ) s — (f ri (5/h)) h , 
where f v (r) = rf l log(l+r/r) and (8/h)(a, a') is defined to be 8(a, a')/h(a, a') if h(a, a') > 0, 
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zero otherwise; note that u = h + S > implies S/h > — 1. Thus from (1.22) we obtain 
BP (8) = Bl(S) + R2(6) where 

BUS) = E„ [2(0 5o + 2( J D - l)(P(8 /h ))- ho + (<0«« ~ (P(8 OJ /h OJ )) h , 

j&do 

BUS) = E,[2(D - l)(\ogK) So ~ X>g^>«„] (3-1) 

Since for r > — 1 and r\ G (0, 1) we have 7]~ l log(l —77) < f v (r) < r, it follows from dominated 
convergence (and the boundedness of £ on supp<5) that R-x(S) converges to a finite limit as 
7] 4- 0, and so converges to zero upon rescaling by | log 77 1 . Again by dominated convergence, 
R%(8)/\ log r/| converges as r\ \, to 

= En [(2 - 2 j D )m ({ ( t : h {a) = 0}) + ^ u oi ({<7, a' : h OJ (a, a') = 0})] . (3.2) 

.j'gdo 

Let A Q = A(t, ) = {o" G : ft, (c) = 0}. Since h xy (a,cr') = whenever either cr G A x or 
a' £ A y , we have by unimodularity of fi that 

£°(<5) > E M [(2 - 2£> )u (A ) + {uo{A ) + %(Aj) - u oj {A x A,)} 

je<9o 

= E M 2u (A ) - ^ u oi (A xAj) =E^Y^ ^ 

where B ^j = 1{D Q > 0}[L>~ 1 u (A ) + DT^Aj) - u oj (A x Ay)] (by (1.20), necessarily 
A Q = when L> = 0). 

Noting that A C Q 7^ 0, consider the measurable function u : — > A%~ defined (up to 
/^-equivalence) by 



U {<7) = U(T, )(.Cr 



l{a = <x p }, 



, else. 



(3.3) 



Among those u G Hi oc with support contained in {(a, a') : cr p G {a, er'}}, there is a unique 
one with marginals (3.3). On the event {D > 0}, we have the following: 
- If cr p G A Q n Aj then 



u^,a / ) = i(l{(T = a*} 



|^\{aP}| 



= a p } 



l{a G ^\{qP}} 
l^\{^ p }| 



,-1 



so ^ = (2DO)- 1 + (2Dj 

- If a p G A n Aj then w (A ) > 1/2 while %(Aj) = = w oi (A x A,-), so iZ^j > 
(2D o y l . Symmetrically if a p G A C Q n Aj then _R ^- > (2-Dj) -1 . 

- If cr p £ A U Aj then B ^j = 0. 
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Thus R°(d) > 0, with strict inequality unless cr p ^ A Q U Aj /i^-a.s., in which case we take 
A a , Aj in place of A C Q , A] in (3.3). Then 

R ^ = {2D )- 1 1{A ± 0} + {2D j )~ x l{A j ^ 0}, 

so i?°(<5) > unless p,(A = 0) = 1. But in this case taking u G Hioc identically equal to 
the uniform measure on supp ip gives 

R°(S) = - ?— - T lJ V |(suppV)\(supph c 

|supp^| 

If h ^ "Hioc^] then this is positive, completing the proof of our claim. □ 

Our main result in this section is the following infinite-tree analogue of [44, Thm. 2], 
characterizing the interior stationary points of $^ as fixed points of the Bethe recursion. 

Proposition 3.3. For ^> permissive, any stationary point of inside %° oc [^\ belongs to 
U*. 

Proof. Let H^ C [V] denote the space of measurable functions 8 : % — >• K'* 2 (defined up to 
//-equivalence) such that supp 5 xy C supp^, S xy (a,a') = d yx (a',a), the one-point marginals 
4(a) = ^6 xy (cr, a') do not depend on the choice of y G dx, and 8{a ) — Ylaa' < K cr > a ') = 
0. 

S'tep 1. We first show that if h G T-L° oc [^] is a stationary point of $ M , then there exists 
A : 7^ — > measurable such that 

h xy {a, a') = ip(a, a') exp{X x ^ y (a) + X y ^ x (o-')}, //-a.s. (3.4) 

Since h G ^^[^1; if <5 G "Hj^J^] with |<5| < /i^-a.s., then h v = h + r/6 belongs to "Hioc^] 
for all \rj\ < 1. Taking 77 — >■ in (3.1) gives (by stationarity of $ M at /i) 

= R (d)=E,[2(K' o )- So + J2(K'h 

j&do 

where «£. = f + (L> x . - 1) log/^, < y = (£ - log h xy )l suppi ,. 

Consider now 6 with one-point marginals 8 = 0, so that the value of ft' becomes irrelevant: 
in this case the value of R°(S) is unchanged upon replacing k' by 

K xy (a,a') = l anp p^(a,a')[K xy (a,a') + X x ^ y (a) + A^ x (cr / )]. 

We claim it is possible to choose A such that k has one-point marginals ft = 0, /v-a.s. This 
amounts to solving the linear system 



„ / -10 II \ \ I - Q A ) ^ 



where, writing r(cr) = |{ct' : i/;(a,a') > 0}|, 



a x ^(cr) = , Q(a,cr) = 



r(cr) r{a) 

For permissive, the Markov kernel Q is irreducible and aperiodic, with stationary distri- 
bution r = (r(cr)) CT (by symmetry of ip). By the Perron-Frobenius theorem, Q, Q 2 both have 
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unique left eigenvector r corresponding to eigenvalue 1. Therefore dimker(J — Q 2 ) = 1, from 
which it is easy to see that ker Q* = (im Q)- 1 is the linear span of (r, — r). Since the assumed 
symmetry properties of ip and h imply that 

((r, -r), (a x -> y , a y ^ x )) = ^(-K xy (a, o') + K' yx (a, a')) = 0, /^-a.s., 

there is a unique solution (X x ^ yj X y ^ x ) to the system (3.5) giving the required solution to 
(3.4). 

For this choice of k, 8 — ck belongs to 'H^ )C [ip] for any measurable c : T e — > M>o with 
c xy = c yx . We can choose c small enough so that \8\ < \h\ on supp-^ /i^-a.s. With this 
choice, = R°(8) becomes the //-expectation of a (weighted) sum of squares, so k = 0, and 
rearranging gives (3.4). 

Step 2. Returning now to general 8 G T^JV'] with \S\ < h jj^-a.s., we obtain from (3.4) the 
simplification 

o = r°(s) = e m [2(k' ) So - e (<w*. + (V*)*,)] = 2 ^[(< - E x ^)- s ] > ( 3 - 6 ) 

j&do j&do 

using unimodularity of fi for the last identity. We claim that 

y£dx a' yddx 

Indeed, for any 5' : T. — > M measurable with Yl a S' (cr) = /i-a.s., 

S' xy (a, a') ee ~6' x (a)l{a' = a*} + 5' y (a')l{a = a*} 

defines an element of "H^JV^- By considering (3.6) with 8 = c8' where c xy = c yx is small 
enough so that |c<5'| < \h\, we obtain the claim (3.7). 

Step 3. Rearranging (3.7) we find that h satisfies /i^-a.s. 

h oj (a, a') = ip(a, a') exp{X ^j(a) + Xj^ (a')}, (3.8) 

y^^lE^U^I (39) 

If we then re-parametrize 

Xo^j = £ + E logm fe _^ /J-a.s. (3.10) 

k£do\j 

(well-defined, for each T and a G by invertibility of the D D -dimensional matrix 11* — /), 
then the formula (3.9) for h a becomes 

h (a) = i)(a) Y\ m k ^ (cr), /J-&.S. 

k&do 



m as 
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On the other hand, h a is the first marginal of h Q j, and setting the above equal to the sum of 
(3.8) over a' gives (making use of (3.10)) 

fhj^oia) ^ (y'y X ^ o{u '\ //-a.s. 

Thus, if we define m : T e — > A$r, m x _> y (<j) = e Xx ^ y ( a \ then (3.10) can be written in terms of 

k£do\j o"fe 

- that is, m £ H*. Then, (3.8) is precisely the statement that m maps to h via (1.21), 
which concludes the proof. □ 

Proof of Thru. 1.16. By (HI) the set Hf" c of h £ U\ oc for which $(P,B,h) > -oo is non- 
empty and does not depend on (/3, B), so without loss we will restrict to h £ "H^- 

Again by (HI), the functions (/?, B) \-t B, h) indexed by h £ Hf^. are uniformly 

equicontinuous on compact regions of (/3,B): for any e > there exists 5 > sufficiently 
small so that if (/3, B) and (/?', 5') are within distance S then |$^(/3, 5, to) -$ At (/3 / , 5', h)\ < e 
for all h £ 7^f n c . Let h £ "Hf n c such that $ M (/3, B, fc) > $p{p,B) - e. Then, 

5 ') > <W> B 'i h) > MP, B) - 2e 
for all (P', B') within distance 5 of (P, B). Reversing the roles of (P, B) and (P', B') completes 
the proof of part (a). The statement of part (b) is a summary of the results of Lem. 3.1, 
Propn. 3.2, and Propn. 3.3. □ 

We supplement Propn. 3.3 by computing the second derivatives <9^$ M (fa. + r]8) at interior 
stationary points h, giving a criterion to verify that such points are local maximizers. 

Proposition 3.4. For permissive let h £ H° oc [ip] be a stationary point of and let 
5 £ K^JV] with \S\ < \h\. Then h is a local maximizer of $ on the one- dimensional space 
"Hioc n {h + r/S : rj £ M} if and only if 

4dfo,(h + v d)\ v=0 = E,[2(D - l)((5 /h )\ o - J2(( 6 o 3 /h OJ ) 2 ) hoj ] < 0, (3.11) 

j&do 

or equivalently 

®MW 2 )- ho ] > ^ [ £ {((W 2 h + ((h/h h f)- hh - ((<VMX)] • ( 3 - 12 ) 

jedo 

It is a strict local maximizer if (3.11), (3.12) hold with strict inequality. 

Proof. For h £ H° oc [ip] and S £ "Hj^jV'] with \S\ < \h\, arguing as in the proof of Propn. 3.3 
gives 

2d^ fl (h + V 8)\ v=0 = UmR\8) = R°(S) 



7?->0 

E„[2<0j o + 2(D Q - l)(logh )- So + J2 ((tK ~ ( lo S^) 5o 
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If h is further a stationary point of then, for 77 < 1, 
T"(_) = -W(8) = -\R V (5) - R°(5)] 

T] 7/ 

= 2E,[2(D - l)(f>(5 /h ))- So - Ys(f V ( S o 3 /h OJ ))s, 

jedo 

+ 2(D - l)(g%S /h ))- ho - (<W^i)>V 

j&do 

where g v (r) = [f v (r) — r]/rj, with lim^o 9 v { r ) — —r 2 /2. Since \S/h\ < 1, it follows by 
dominated convergence that 

Adft,(h + V 8)\ v=0 = limT"^) = T°(<5) = E,[2(D a - l)((5 /h )\ o - ^(((5 OJ /h OJ ) 2 ) h , 

j£do 

The stationary point h is a local maximizer on T-L\ oc PI {h. + t]8 : r/ G K} if and only if 
(9^$^(h.+?75)| r)= o < 0, which gives (3.11). The condition (3.12) is equivalent by an application 
of unimodularity. □ 

4. Application to Ising and Potts models 

In this section we apply Thm. 1.13 to prove our results for the ferromagnetic Ising and 
Potts models, Thms. 1.8-1.10. Although both models have regimes of multiple fixed points, 
monotonicity arguments allow us to restrict the space of fixed points. In the Ising model we 
can restrict to a unique fixed point and give a complete verification of the Bethe free energy 
prediction; in the Potts model with q > 2 there remain regimes of non-uniqueness where we 
can only provide bounds. 

4.1. Ising model. We first prove Thm. 1.8. Recall the definition (1.12) for the Ising measure 
Uq B for a finite graph G = (V,E), and more generally (from Defn. 1.7) the Ising measures 
v\f(? and Vu'q B for a finite sub-graph U of a (possibly infinite) graph G with free and + 
boundary conditions. We will make use of the following direct consequence of the classical 
Griffiths's inequality (see e.g. [29, Thm. IV. 1.21]). 

Lemma 4.1. For the Ising model with parameters (3,B > on U a finite sub-graph of a 
graph G with boundary conditions f G {f, +}, the magnetization (o~ v )j}^f at vertex v G U is 
non-negative, non- decreasing in (3, B, non- decreasing in U for \ = f , and non-increasing in 
U for% = +. 

Recall from §1.2.1 the definitions of Ji^ for f G {f, +}; the measure is parametrized 
by the corresponding magnetization my = Jij, (+) — (— ). By Lem. 4.1, m^ f is non- 
decreasing in t while fnj^ is non-increasing, so there exist well-defined limits m^(/3,S) = 
Hindoo m^(/3, B). The following result from [11], an extension of [9, Lem. 4.3], shows that 
these limits agree on any T G 71. 
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Lemma 4.2 ([11, Lem. 3.1]). For the Ising model (1.12) on an infinite tree T with /3, B > 0, 
there exists a constant C = C(/3,B) such that 

m% + - fftf < C/t Vt > 1. 

By this result we can define h G H by h x ^ v = hL = Jit , and we now proceed to 
verify the Bethe prediction <f>((3, B) = B, h). 

Proof of Thru. 1.8. The Ising model (1.12) is of form (1.1) with JT = {±1}, £(o~,cr') = 0o~o~', 
and £(cr) = Ba, so (HI) and (H2) are clearly satisfied (with no additional moment conditions 
on D a , since if) > 0). It follows directly from the recursive structure of the tree that h G %*. 
It will be shown in Lem. 4.5 that for /3 > fixed, 

lim limsup | </>„(/?, B) - $„(/3, B,h)\ = 0, 

so to prove the theorem we will interpolate from (/3, B) to (/3, £?i), then take B\ — > oo. 

It follows from Lem. 4.1 and Lem. 4.2 that for T G 7^, fn f T ((3 : B) = mJ(/3, -B) = ?tit(/3, -B) 
is the increasing limit of (/?, 5) and the decreasing limit of m^ + (/3, .B). The #'(5, -B) are 
continuous and non-decreasing in /3, 5, so m inherits these properties by the same argument 
as in the proof of Thm. 1.11, and so (since it takes values in [—1,1]) is of uniformly bounded 
total variation. This verifies both (H3^) and (H3 B ) (though we will use only the latter). 

We conclude by showing (cf. (1.19)), that 

lim E^dCK))^] = a vx (/3, B) = E M P B £(a )]^ B ]- 

Here <9b£(o") = a, and it follows from Lem. 4.1, our assumption of G n —ti wc n and Fatou's 
lemma that 

^A¥ofT^ B \ < Hminf E„[<<7 >^£] < liminf E n [(a In f n ' B ] 

t—>-oo ' n— too 

< limsup E n [(a In ) p n ' B ] < lim sup E^)^/] < E^jf^]. 

n— >oo t—too 

The left-most and right-most expressions coincide by Lem. 4.2 so equality holds throughout. 

By Thm. 1.13 (b), B) = $(P,B,h + ) = $(/3, B, h { ) for (3 > 0, B > 0. Since n is 
symmetric in B and continuous at B = (uniformly in n), we have (f>(f3, B) = <f>((3, —B) and 
</>(/?, 0) =lim^ o 0(/3, J B). □ 

4.2. Potts model. We now apply Thm. 1.13 to deduce our result Thm. 1.9 for the Potts 
model (1.3) with /3, B > 0. From now on we let E£ = [q] with q >2. It will be convenient 
to generalize (1.3) to the inhomogeneous Potts model 

v^ia) = exp { ^ AAw + " G ^ 

We now introduce the coupling of the Potts model with a random-cluster model which 
we use to obtain monotonicity properties. The following representation is as in [22] (see 
also [6]). If G = (V,E) is a finite graph, let G* be the graph formed by adding an edge 
from every v G V to a "ghost vertex" v*, that is, G* = (V*,E*) where V* = V U {v*} and 
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E* = E U {(v , v*) : v G V}. Writing a for elements of and 2 for elements of {0, 1} E * 
(bond configurations), consider the probability measure on pairs (a, rf) defined by 

w^(a,n) = J] ~ II - !)<W ( 41 ) 

8 B 

The marginal on a v is the inhomogeneous Potts measure Vq~ , while the marginal on n is 

the (inhomogeneous) random- cluster measure 

4*(n) = n p"^ 1 - p^~ rh n ^ ( 4 - 2 ) 

ee£* Cer; 

where = 1 — e _/3iJ for G -E and pi V * = 1 — e~ Bi for z G V, and the last product 

is taken over connected components C of 2, with 0(C) = g unless t>* G C in which case 
0(C) = 1. Given a configuration 2, a realization of the conditional law TUQ B (a = -\ri) is 
obtained by choosing a constant spin on each connected component C of n independently 
and uniformly over [q], except for C containing v* which is given spin 1. 

For a detailed account the random-cluster model see [21]; we will use only the following 
basic properties: 

8 B 

Proposition 4.3. The random- cluster measure Hq is FKG. It is also increasing, in the 
sense of stochastic domination, in (@_,B). 

Proof. The FKG property follows by a straightforward modification of the proof of [6, 
Thm. IILl(i)]. Monotonicity in (@_,B) follows by modifying the proof of [21, Thm. 3.21]. □ 

Recalling Defn. 1.7, for U a finite sub-graph of a graph G and f G {f}U [q] (with f = free), 

i 8 B 

let v% q denote the Potts model on U with f boundary conditions. 

Corollary 4.4. For the Potts model with parameters (3,B > on U a finite sub-graph of a 
graph G with boundary conditions f G {f, 1}, and for any vertices v,w G U , the quantities 

v u,G \?v = !)> u,G K = a ») 
are non- decreasing in (3 and B , non-increasing in U for $ = 1, and non- decreasing in U for 
t = f- 

Proof. Note that v\f G B is the marginal on g_ v of the measure tUq— with 

Bi = B Vi G V, fi e = pi{e G E v }. 

Similarly, v\f^ is the marginal on q_ v of the measure Wq '— with 

f3' e = Ve G E, B\ = Bl{i G Vu} + ool{i £ V v }. 

Clearly, (/3, B_) is non-decreasing in U while (/?', S') is non-increasing, and both are non- 
decreasing in (3, B. The result therefore follows from Propn. 4.3 by showing that for any 
(d,B_), the conditional probabilities vDq~ (o~ v = and WQ—{a v = a w \rj) are monotone 
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functions of ij. Indeed, letting w = Wq and writing v yj to indicate that v, w belong to 
the same connected component of n, we have 

1 — 1-Pu <~**^> v*\ 

zu(o v = = l{v v } H -, 

1 — 1-ff w\ 

w(a v = crJrj) = l{v w} H . 

These are increasing functions of 2 so the proof is complete. □ 

Under the measures with \ 6 {f, 1}, any one-vertex marginal must be uniform on the 
spins 7^ 1, and so is characterized by the probability given to spin 1. In particular, recall 
from §1.2.2 the definitions of for f G {f, 1}; existence of the t — >■ oo limits Jv T is now 
justified by Cor. 4.4, so we can define G % by h\_^ y = h\^. The following lemma gives 
the boundary values for the interpolation in (/?, 5) using fv: 

Lemma 4.5. For the Potts model on G n — >i wc \i, let 

B) = B + 0E»[D o }/2 + E M [^(|T|)], <p(n) = (p B {n) = n' 1 log(l + (q - l)e- Bn ). 

(a) For all B G R and any heU, 0(0,5) = log(e B + q - 1) = 9^(0, B,h). 

(b) For (3>0 and h G W, 

lim limsup|0 n (/3, J B)-$ M (/3, J B)| = O= lim S, /i) - B)\ 

B— >oo n >oc B—>co 

(c) For B > 0, lim^oolimsup^ \(f> n (j3,B) - = 0. 

(d) ForB>0 and f G {f, 1}, lim^ 5, ft*) - B)\ = 0. 

Proof, (a) At /3 = 0, ^ = 1 so the spins are independent. Thus, for all n > 1, h G "H, and 
T G 71, 

n (O, B) = log(e B + q - 1) = <£^ x (0, B, h) = $ T (0, 5, /i), 
since <&j^ = for all j G <9o. 

(b) The value of Z n (/3, B) is bounded below by considering only the ground state a = 1, and 
bounded above by decomposing <5T y according to the subset of k vertices where the spin is 
not 1. For > this gives 

1 < Z n (P, B)e- Bn ~^ < (?) (<? - i)^" 5 " = (! + (?- l)e~T, 

so if we define (f> n (/3,B) = (fi n (/3,B)-B-/3E n [\E n \]/n, then lim^oo lim sup n ^. oc \<f) n (f3,B)\ = 
0. Recalling (2.2), this proves the left identity in (b). 
We next define 

$™ = $™ - B - PD , & T = $° T - f3D /2, $ T = $™-$ C T , $^ = E M $ T , 

so that to prove the right identity in (b) it suffices to show lims^oo $ M (/3, B, h) — for any 
ft, G Indeed, (1.15) gives that /i-a.s., limg^oo hP^Acr) = 5 aj i for all j G do, hence also 
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liniB-s>oo hj!^ (a) = 5 a ^ for all j G do by equivalence of and /A Thus 

lim ^ x (/3, S, A) = = lim 5, h), //-a.s. 

It is easily verified that 

- 0A, < 3W, 70 < log 9, -A/2 < §1(0, B, h) < 0, (4.3) 
so fj,(/3 , B , h) — > by dominated convergence. 

(c) Suppose first that G n is connected. Then Z n (j3, B) is bounded below by considering only 
the q constant-spin configurations, and bounded above by decomposing 2J V according to the 
subset of t edges across which the spins disagree. Since G n is connected, removing £ edges 
leaves at most £ + 1 connected components, of sizes ko, . . . , kt summing to n. Therefore, with 
(p{n) = (f B (n) = mp B (n), we have 

e*»> < Z n (P,B)e~ Bn -^ < £ (^ nl )e^ max { exp { £>(*,)}}, 

where the maximum is taken over ko, . . . , ke G Z>o summing to n. By convexity of ip this 
maximum is achieved with k r = n for some r, so 

zip i\ 

f(n) <n$ n (j3,B) < y?(n) + E n [log { ^ I 1 n 'je^VjJ = + E n [|£ n |] log(l + ge"^). 

(4.4) 

If G n has connected components C- 7 = (V^E^), j > 1, with \V^\ = n\ then clearly 
Z n (p,B) = U j Z cj (p,B),so 

O<0 n (/3,5)-^E n [^(^')] <^n{\E n \]log(l + qe-P). (4.5) 

j 

With j(z) denoting the index of the connected component of G n containing vertex i, we have 
n~ 1 E n \^2 j cp{n j )) = E n [<p(n j ^)). Then, since (p'{n) < 0, 

E n [^(|B t (I n )|)l{£ t (/ n ) = < E^(n^)] < E n [?{\B t (I n )\)\. 

Since G n -^i wc letting n — > oo followed by £ — > oo in the above inequalities gives 
E n [Lp{n^ In ^)] E M [<y?(|T|)], and so (c) follows from (4.5) by taking first n — > oo and then 
/3 ->• oo. 

(d) Clearly fay = fty for any finite T e T. (as = for large enough £). In the /3 — > oo 
limit only the constant-spin configurations contribute, so 

lim h^ B (a) = e -fi\T\)-B\T\{x-s^)^ t e {f; 1} (4 6) 

For T infinite, recall from Cor. 4.4 that h f Tt (l) < ^(1) < ^t(1)j so if -B > then 
1 = lim lim tf'W'BQ) < lim h { '^ B (l) < lim h}^ B {l), 
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so that (4.6) again holds for T infinite. We then compute 
lim B, h*) = -^ v(|Zi_J) + <p(\T\)> 



/3— >oo 



hm <J e r (/3, S, A*) = -1 J2 v(\ T i-*o\) - \ £ ^(l T o^D + T^ (|T|) ' 

/x-a.s., where the first identity uses |T| = 1 + ^jea l^}->ol anc ^ the secon d uses |T| = 
l^o-^'l + l^}->ol- Convergence also holds in //-expectation, using the upper bounds in (4.3) 
together with 

j&do j&do j&do 

and the fact that h^y(l) > l/q for (3, B > (by Cor. 4.4). Thus, using unimodularity of 
/i, we have 

lim $MB,h*) =E A4 [(1-D /2)^(|T|)], 

and we conclude by showing that this coincides with E M [<^(|T|)]. The case \T\ = oo is trivial; 
otherwise, another application of unimodularity gives 

1 -^[dm\t\)\ = ^e m [a, J>(|t|)] = ±e,[J2dm\t\) 

x£T x£T 

= EM\T\)\Et\] = EM\T\)] - ®M\T\)]- 
Therefore, lim^oo $ At (/3, B, h) = E M [<^(|T|)] which concludes the proof. □ 

Proof of Thm. 1.9. The Potts model (1.3) is of form (1.1) with ^ = [q], ^(a,a') = (35 a y, 
and £(cr) = B5 Ut i, so (HI) and (H2) are clearly satisfied. It follows from the recursive 
structure of the tree that h} e W for $ e {f, 1}. For part (a), along any interpolation path 
contained in TZ^, both (IK' 3 ) and (H3 B ) are satisfied by Cor. 4.4 and the same argument 
used in the proof of Thm. 1.11. For part (b), (IK' 3 ) and (H3 B ) are satisfied by the additional 
hypothesis of continuity. 

The inequalities in part (b) then follow from Theorem 1.13 once we verify (cf. (1.18), 
(1.19)) 

a vx (/3, B, h f ) < liminf < x (/3,£) < limsup< x (/3,5) < a vx (/3, B, h 1 ), 
a e (/3,5,/i f ) < liminf a e n (/3,B) < limsup<(/3, B) < a e (/3, B, h 1 ), 

n— >oo n— >oo 

where a™(/3,B) = E^^xff) and <(/?,£) = §E n [ ^di n <<W; )i '* ] ■ Indeed ' b ^ 
Vitali's convergence theorem, the assumption G n -^i wc ^ and Cor. 4.4 (with U = B t (I n ) C 
G n ), we have 



a c 



((3,B,h f ) = liminf ^Ej£(^)^l < liminf <(/3, 5), 



jedo 
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and the other inequalities are proved similarly. Together these inequalities imply that 
lim [0 n (/3', B') - n (/3, B)} = $(/?', B', h?) - $(/?, B, h?) 

n— >oo 

for any (/3,B) and (f3',B') joined by an interpolation path contained in IZ^. The result of 
part (a) then follows by letting (/?', 5') approach and applying Lem. 4.5. □ 

4.3. Potts model with d-regular limiting tree. In this section we prove Thm. 1.10, 
which amounts to determining the shape of TZ-l and establishing continuity of h { and h 1 in 
certain regimes. 

Since the limiting measure is supported on T d , only h = h(j d ,x-yy) is of relevance. Further, 
h* is symmetric among the spins ^ 1 for f e {f, 1}, so determination of h} reduces to solving 
a univariate recursion for h*(l), 

h ^ e B [e^h+(l-h)] d - 1 

e B [ePh + (1 - h)]*- 1 + {q-l)[h+ ^{e? + q- 2)]^ ' 

Our result follows from analysis of the fixed points of this mapping (similar computations 
have appeared e.g. in [41, 45] so some overlap among the analyses may occur). 

A convenient parametrization is given by the log likelihood ratio r = log/i — log[(l — 
h)/(q — 1)], in terms of which the recursion becomes 



r ^ f(r) = f(r; (3, B) = B + (d - 1) log (- 



-e r + el 3 + q - 2, 

With /w the t-fold iteration of /, let r f denote the increasing limit of /^(0) and r 1 the 
decreasing limit of /w(oo), as i -> oo. The region corresponds to those /3, B > for 
which r f 7^ r 1 . 

Lemma 4.6. There exists /3_ > snc/i i/iai /or /3 < /?_ £/ie map / /ias exactly one fixed 
point for any Bel. For /3 > /5_ i/iere exist real-valued B_((3) < B + ((3) (smooth in (3) such 
that f has one, two, or three fixed points depending on whether B is in [B_, B + ] c , {B_, B + }, 
or (B_,B + ). The curves extend continuously to B_((3_) = B + (f3_). 

Proof. We have 

f(r) = (^-IK(^-I) + f4 7) 

J V } (q + e r + e? - 2) (q + e r +? - 1) 1 ' ' 

so / is increasing in r with f'{r) — > as r — > ±oo. Since f(r;(3,B) = f((3;r,B), it easily 
follows from (4.7) that dpf(r) has the same sign as r while dp[f'(r)] > 0. Further 

/ (f = ^> q , ol = (q — 1 (1 + (o — 2 e p ), 

^ W (g + e r + - 2) 2 (g + - l) 2 ' ^ A ^ ' 

with a > since g > 1. Notice that f"(r) > for r sufficiently negative and /"( r ) < for r 
sufficiently positive, with a single sign change occurring at (loga)/2 which is zero for q = 2 
and strictly positive for q > 2. This proves that / has between one and three fixed points. 
When B = 0, one fixed point is always given by r f (/3,0) = 0. Further f(r; 0, 0) ee 0, so (by 
monotonicity of /' in f3) there exists oo > /3_ > such that /' < 1 everywhere for /3 < /?_ 
and /' exceeds 1 somewhere for /3 > 
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Solving the equation f'(r) = 1 in terms of t = e r yields solutions 

d. 



t ± {(3) = -7 ± vV - «, 7 = e p + q - 2 - -(1 - e~ p )(e p + q-l). 



Since a > 0, £±(/3) are not positive if 7 > — y/a, equal to y^a > if 7 = —y/a, and positive 
but not equal if 7 < — y/a. If d > 2, it is easy to check that both a and 7 decrease smoothly 
in (3, starting at 7^=0 = q — 1 and a|^=o = (q — l) 2 , so there is a unique value (3 = /3_ > at 
which 7 = — y/a: if d = 2 then /3_ = 00, and if d > 2 then /?_ is the logarithm of the unique 
finite positive root &_ of 

(d - 2)V + (d - 2) 2 (q - 2)6 - d 2 (q - 1) = 0. (4.8) 

Hence, the equation f'{r) = 1 has no solutions for /?</?_, and it has solutions p±(f3) = 
logt±(/3) for /3 > /3_, with p_(/3_) = p+(/3_) and p_(/3) < p+(/3) for (3 > /5_. The values of 
5_(/3), B + (j3) are then given explicitly by 

B ± (P) = p T (P)-f(p T (l3y,P,0), (4.9) 

which clearly meet at /3 = /3_ and are smooth for /3 > □ 

Considering hereafter only d > 2 (so that /?_ < 00), suppose /3 > so that the func- 
tions p± are defined. Since ^[/'(r)] > 0, p_ and p + must be, respectively, decreasing and 
increasing in /3. Further, since / has a unique inflection point at (loga)/2, we must have 
p~{0) < (log of) /2 < p+(/3), with strict inequalities unless p_(/3) = P+(/?). For g = 2 (Ising), 
this implies p_ < < p + from which it is easy to see that whenever B > we have 
r f (/3, 5) = r 1 ( J 5, 5), which is then continuous in (/3, 5) by the same argument as in the proof 
of Thm. 1.11. When B = 0, r f (/3,0) is zero for all /3, while r l (f3,B) is zero for /3 < /3_ and 
strictly positive for /3 > /3_. 

For g > 2 (Potts), this implies that p + (/3,B) > while p_(/3,B) > if and only if 
/'(0; -B) < 1. From the calculations above, /'(0) is zero at = and increases in /3. We 
therefore define 

f3 { = inf{/3 > : /(r; /3, 0) = r for some r > 0}, 

/3 + = inf{/3 > : p_(/3) < 0} = inf {/3 > : /'(0; /?, 0) > 1} = log (l + ^) (4.10) 

(where the formula for (3 + comes from (4.7)). Clearly /?_ < /9f < (3 + , and in fact these 
inequalities are strict: at (3 { , f must exceed one between zero and the positive fixed point, 
so < (3f. 2 Likewise, if f'(0) > 1 at (3 = (3 { , the concavity of f(r) at r = would imply the 
existence of a positive fixed point at some (3 below p\ which is a contradiction, so /3f < (3 + . 
We refer again to Fig. 1 which shows the maps /(r; (3, B) for the Ising and Potts models at 
several values of (3 while holding B = 0. Fig. 2b shows the regime of {/3,B) values delineated 
by the curves B±((3). 

Proof of Thm. 1.10. (a) We found above that TZ-l = for d = 2 and TZ-l = (/3_,oo) for 
q = 2, so suppose d,q > 2. If 5 > 0, r f = r 1 holds for all /3 > with /3 ^ (/3_,/3+). For 
/3 G (/?_, (3 + ) there is a closed interval [B_((3) V 0, _B + (/3)] of B values for which r f < r 1 : this 



2 Note that r 1 (/3f,0) > 0, i.e., the 1-biased fixed point "arises discontinuously." 
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interval is strictly positive for /3 < /3f and includes zero for P > Pf. If B = 0, r = r 1 for 
< P < 0t and r f < r 1 for /5 > ft. Recalling (4.9), 

dpBjfi) = d?ip T ((3)-f(p T ((3))} = [l-fip^iP))]dfi/^ifi)-W)iP^m = -Qpf){f*{P))- 
This has the same sign as — p T ((3), which are both negative for < (3 < (3 + , so the curves 
B±(f3) are decreasing. Inverting them gives the curves pf (B), P+{B) which delineate the 
region 7Z^ as described in the theorem statement, with /3f(0) = /3f and P+(0) = (3 + . 

(b) Away from the boundary of TZ^, h { and h l correspond to isolated zeroes of a smooth 
function, and so are continuous by the implicit function theorem. From part (a), any point 
of 1Z is connected to TZ^ by an interpolation path contained in 7Z, so applying Thm. 1.9 (a) 
verifies the Bethe prediction for (f3,B) G" 1Z^. 

Since changing B only translates /(r; P, B), it is not difficult to see that when (3 G (/3_, (3 + ), 
the function /i f (/3,£>) is continuous in B for B G [0, _£?+(/?)] while /i 1 (/3, i3) is continuous for 
5 G [-B_(/3) V 0,oo). It follows by Lem. 2.1 that for ((3,B) G <97^ with (3 = (3 { (B), 
B) = $(/?, 5, /i f ), while for (/?, 5) G d1Z + with /3 > (3+(B), B) = 5, h 1 ). 

Recall our convention that Po < Pi, Bo < B\. by Thm. 1.9 (b) we may interpolate in B 
from (P,B ) G 7?.^ to (/3,-Bi) G 7£ using the message h 1 , yielding liminf^oo (f) n (fi, B) > 
$(/3,£>, h 1 ) for (P,B) G 72.^. Likewise, we may interpolate in B from (P,B ) G K to 
(P,Bi) G 7^ using /i f (and once inside 7^ we may also interpolate in p using h { ), which 
gives liminf n _ >oo n (/3, J B) > $(/3,5,/i f ) for (P, B) G K^. 

Next, since /i f (/3,S) and /i 1 (/3,-B) are lower and upper semi-continuous respectively in p, 
and both are non-decreasing in p, for < B < B + we have that h f (P,B) "f tf^P+^B), B) 
as p t and h x {p,B) 1 h\p { {B),B) as /H p { (B). Again by Thm. 1.9 (b), we may 

interpolate in /3 from (Po,B) = (/3 f (£?),£?) G <97^ to (Pi,B) G 7?.^ using ft, 1 , and from 
(/3 ,5) G ^ to (A, 5) = (P+(B),B) G <97^ using /i f , giving 

lim sup n (/3, 5) < min{$ f (P, B), $ x (/3, B)}, (/3, 5) G 7^, 

n— >oo 

which completes the proof. □ 
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